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ods and uses for such proteins and derivatized proteins. The invention 
is particularly suitable for use in the synthesis of optionally polymer- 
modified, synthetic bioactive proteins, and of pharmaceutical compo- 
sitions that contain such proteins. 
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Title of the Invention 

"Pseudo'^-Native Chemical Ligation 

Field of the Invention 

The present invention relates to methods and compositions for extending the 
5 technique of native chemical ligation to permit the ligation of a wider range of 
peptides, polypeptides, other polymers and otfier molecules via an amide bond. The 
invention further provides methods and uses for such proteins and derivatized 
proteins. 

Cross-Reference to Related Applications 

10 This application is a continuation-in-part of US Patent Applications Serial 

Nos. 60/231,339 (filed September 8, 2000) and 60/236,377 (filed September 29, 
2000), both of which applications are herein incorporated by reference in their 
entirety. 

Background of the Invention 

15 Over the past 30 years, medical attention has increasingly turned to the 

possibility of using naturally produced proteins as therapeutic drugs for the treatment 
of disease. 

Recombinant DNA techniques have become the primary method for 
commercial production of many polypeptides and proteins because of the large 

20 quantities that can be produced in bacteria and other host cells. Recombinant protein 
production involves transfecting or transforming host cells with DNA encoding the 
desired exogenous protein and growing the cells under conditions favoring 
expression of the recombinant protein. E. coli and yeast are favored as hosts because 
they can be made to produce recombinant proteins at hi^ titers (see, U.S. Patent 

25 5,756,672 (Builder et al). 
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Numerous U.S. Patents have been issued with respect to general bacterial 
expression of recombinant-DNA-encoded proteins (see, for example, U.S. Patents 
4,565,785; 4,673,641; 4,738,921; 4,795,706; 4,710,473). Unfortunately, the use of 
recombinant DNA techniques has not been universally successfiil. Under some 

5 conditions, certain heterologous proteins expressed in large quantities from bacterial 
hosts are precipitated within the cells in dense aggregates, recognized as bright spots 
visible within the enclosure of the cells under a phase-contrast microscope. These' 
aggregates of precipitated proteins are referred to as "refractile bodies," and can 
constitute a significant portion of the total cell protein (Brems et cd.y Biochemistry 

10 (1985)24:7662. 

Recovery of protein from these bodies has presented numerous problems, 
such as how to separate die protein encased within the cell from the cellular material 
and proteins harboring it, and how to recover the inclusion body protein in 
biologically active form. For general review articles on refractile bodies, see 

15 Marston, supra; Mitraki and King, Bio/Technology, 7: 690 (1989); Marston and 
Hartley, Methods m EnzymoL, 182: 264-276 (1990); Wetzel, "Protein Aggregation 
In Vivo: Bacterial Inclusion Bodies and Mammalian Amyloid," in Stability of 
Protein Pharmaceuticals: In Vivo Pathways of Degradation and Strategies for Protein 
Stabilization, Ahem and Manning (eds.) (Plenum Press, 1991); and Wetzel, 

20 **Enhanced Folding and Stabilization of Proteins by Suppression of Aggregation In 
Vitro and In Vivo," in Protem Engmeering-A Practical Approach, Rees, A. R. et al 
(eds.) (IRL Press at Oxford University Press, Oxford, 1991). A need therefore exists 
for an alternative way of producing bioactive proteins. 

One alternative to recombinant production of proteins involves the use of the 

25 principles of organic chemistry to synthesize proteins. Existing methods for the 
chemical synthesis of proteins include stepwise solid phase synthesis, and fragment 
condensation either in solution or on solid phase. The classic stepwise solid phase 
synthesis of Merrifield involves covalently linking an ammo acid corresponding to 
the carboxy-termmal amino acid of the desired peptide chain to a solid support and 

30 extending the polypeptide chain toward the amino end by stepwise coupling of 
activated amino acid derivatives having activated carboxyl groups. After completion 
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of the assembly of the fiiUy protected solid phase bound peptide chain, the peptide- 
solid phase covalent attachment is cleaved by suitable chemistry and the protecting 
groups removed to give the product polypeptide. 

There are unfortunately multiple disadvantages to the stepwise solid phase 
5 synthesis method, including the formation of solid-phase bound by products that 
result from incomplete reaction at the coupling and deprotection steps in each cycle. 
The longer the peptide chain, the more challenging it is to obtain high-purity well- 
defined products. The synthesis of proteins and large polypeptides by this route is a 
time-consuming and laborious task. 

10 The solid phase fragment condensation approach (also known as segment 

condensation) was designed to overcome the difficulties m obtaining long 
polypeptides via the solid phase stepwise synthesis method. The segmmt 
condensation method involves preparation of several peptide segments by the solid 
phase stepwise method, followed by cleavage from the solid phase and purification 

IS of these maximally protected segments. The protected segments are condensed one- 
by-one to the Sist segment, which is bound to the solid phase. Often, however, 
technical difBculties are encountered in many of the steps of solid phase segment 
condensation. See E. Atherton, et a/., "Solid Phase Fragment Condensation -The 
Problems," in Innovation and Perspectives in Solid Phase Synthesis 1 1-25 (R. Epton, 

20 et al. 1990). For example, the use of protecting groups on segments, to block 
undesired ligating reactions can frequently render the protected segments sparingly 
soluble, interfering in efBcient activation of the carboxyl group. Limited solubility 
, of protected segments also can interfere with purification of protected segments. See 
K. Akaji etd., Chem. Pharm. BuIl.(Tokyo) 33:184-102 (1985). Protected segments 

25 ' are dif5cult to characterize with respect to purity, covalent structure, and are not 
amenable to high resolution analytical ESMS (electrospray mass spectrometry) 
(based on charge). Racemization of the C-terminal residue of each activated peptide 
segment is also a problem, except if ligating is performed at Glycme residues. 
Moreover, cleavage of the fully assembled, solid-phase bound polypeptide from the 

30 solid phase and removal of the protecting groups frequently • can require harsh 
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chemical procedures and long reaction times that result in degradation of the fully 
assembled polypeptide. 

Segment condensation can be done in solution rather than on solid phase. See • 
H. Muramatsu et al, Biochem. and Biophys.. Res. Commn. 203(2):1 131-1139 
5 (1994). However, segment condensation in solution requires purification of 
segments prior to ligation as well as use of protecting groups on a range of different 
side chain functional groups to prevent multiple undesired side reactions. Moreover, 
the ligation in solution does not permit easy purification and wash steps afforded by 
solid phase ligations. Furthermore, the limitations with respect to solubility of 
10 protected peptide segments and protected peptide intermediate reaction products are 
exacerbated. 

Chemical- ligation of peptide segments has been explored m order to 
overcome ttie solubility problems frequently encountered with maximally protected 
peptide. Chemical ligation involves the formation of a selective covalent Imkage 

15 between a first chemical component and a second chemical component Unique, 
mutually nsactive, fimctional groups present on the first and second components can 
be used to render the ligation reaction chemoselective. For example, the chemical 
ligation of peptides and polypeptides mvolves the chemoselective reaction of peptide 
or polypeptide segments bearing compatible Unique, mutually-reactive, C-terminal 

20 and N-terminal amino acid residues. Several different chemistries have been utilized 
for this purpose, examples of which include native chemical ligation (Dawson, e/a/.. 
Science (1994) 266:776-779; Kent, et al, WO 96/34878; Kent, et al, WO 
98/28434), oxime forming chemical ligation (Rose, et al, J. Amer. Cham. Soa 
(1994) 116:30-34), thioester forming ligation (SchnSIzer, et al. Science (1992) 

25 256:221-225), thioetiier forming ligation (Englebretsen, et al, Tet Letts, (1995) 
36(48):8871-8874), hydrazone forming ligation (Gaertner, et al, Bioconj. ChenL 
(1994) 5(4):333-338), and thiazolidine forming ligation and oxazolidine forming 
ligation (Zhang, et a/., Proc, Natl Acad Sci. (1998) 95(16):9 184-9 189; Tarn et al, 
WO 95/00846; US Patent No. 5,589,356) or by other methods (Yan, L.Z. and 

30 Dawson, P.E., "Synthesis of Peptides and Proteins without Cysteine Residues by 
Native Chemical Ligation Combined witii Desulfiirization," J, Am. Chem, Soc. 2001^ 
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123, 526-533, herein incorporated by reference;" Gieseinan et al. Org. Lett 2001 
3(9):133M334; Saxon, E. et a/., *Traceless" Staudinger Ligation for the 
Chemoselective Synthesis of Amide Bonds. Org. Lett. 2000, 2, 2141-2143). 

Of these methods, only the native chemical ligation approach yields a 
5 ligation product having a native amide (i.e. peptide) bond at the ligation site. The 
original native chemical ligation methodology (Dawson et al., supra; and WO 
96/34878) has proven a robust methodology for generating a native amide bond at 
the ligation site. Native chemical ligation involves a chemoselective reaction 
between a first peptide or polypeptide segment having a C-terminal a- 

10 carboxythioester moiety and a second peptide or polypeptide having an N-terminal 
cysteine residue. A thiol exchange reaction yields an initial thioester-Unked 
intermediate, which spontaneously rearranges to give a native amide bond at the 
ligation site while regenerating the cysteine side chain thiol. The primary drawback 
of the original native chemical ligation sq^proach is that it requires an N-terminal 

15 cysteine, i.e., it only permits the joming of peptides and polypeptide segments 
possessing an N-teiminal cysteine. 

Notwithstanding this drawback, native chemical ligation of peptides with N- 
terminal amino acids other than cysteine has been reported (W098/28434). In this 
approach, the ligation is performed using a first peptide or polypeptide segment 

20 having a C-tenninal a-carboxytfaioester and a second peptide or polypeptide segment 
having an N-terminal N-{thiol-substituted auxiliary} . gixjup represented by the 
formula HS-CH2-CH2-0-NH-[peptide]. Following ligation, the N-{thiol substituted 
auxiliary} groiqj is removed by cleaving the HS-CH2-CH2-0-auxiIiary group to 
generate a native amide bond at the ligation site. One lunitation of this metiiod is 

25 that the use of a merc^toethoxy auxiliary group can successfully lead to amide bond 
fomiation only at a glycme residue. This produces a ligation product tihat upon 
cleavage generates a glycine residue at the position of tiie N-substituted amino acid 
of the second peptide or polypeptide segment. As such, this raibodiment of the 
method is only suitable if one desires the ligation product of the reaction to contain a 

30 glycine residue at this position, and in any event can be problematic with respect to 
ligation yields, stability of precursors, and the ability to remove the O-linked 
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auxiliary group. Although other auxiliary groups may be used, for example the 
HSCH2CH2NH-[peptide], without limiting the reaction to ligation at a glycine 
residue, such auxiliary groups cannot be removed from the ligated product. 

Accordingly, what is needed is a broadly applicable and robust chemical 
5 ligation system that extends native chemical ligation to a wide variety of different 
amino acid residues, peptides, polypeptides, polymers and other molecules by means 
of an effective, readily removable thiol-containing auxiliary group, and that joins 
such molecules together \vith a native amide bond at the ligation site. The present 
invention addresses these and other needs. 

10 Brief Description of the Figures 

Figure 1 illustrates the overall reactions of pseudo native chemical ligation. 

Figure 2 illustrates the overall process of preparing the synthetic bioactive 
proteins of the present invention. 

Figure 3 depicts tiie basic structure of a preferred type of synthetic 
1 5 erythropoiesis stimulating protems. 

Figure 4 shows the general structure of circularly permuted SEP analogs 
having no free amino or carboxy terminus. The pPEG ^precision PEG" as 
described herein) attachment sites are positionally the same- relative to the SEP 
analogs. The SEP analogs are circularized by an oxune linkage between the new N- 
20 /C-terminal positions, such as P126C and A125X. The total molecular weight of the 
SEP analogs will range from about 50-80 kDa, depending on the pPEG utilized for 
modification of a given analog, with the pPEG mediated hydrodynamic MW 
estunated to be greater than about 100 kDa. 

Figure 5 shows the basic structure of synthetic bioactive GCSF protefais. In 
25 the figure, "J" designates a non-natiirally encoded residue having a hydrophobic side 
chaui. 

Figure 6 shows the structure of preferred synthetic RANTES analogs. In the 
Figure, NNY « nonanoyl, X = non-naturally encoded ammo acid havmg a 
hydrophobic side chain, pPEG and FA = fetty acid 
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Figure 7 depicts schematically the foimation of branched-chain pPEG 
polymer. 

Summary of the Invention 

The present mvention concerns methods and compositions for extending the 
5 technique of native chemical ligation to permit lEhe ligation of a wider range of 
peptides, polypeptides, other polymers and other molecules via an amide bond. The 
invention further provides methods and uses for such proteins and derivatized 
proteins. The invention is particularly suitable for use in the synthesis of optionally 
polymer-modified, synthetic bioactive proteins, and of pharmaceutical compositions 
10 that contain such proteins. - 

hi detail, the invention provides a synthetic protein (especially a protem 
having a monomer molecular weight greater than about 25 kDa) containmg a 
pseudo-amino acid residue whose side chain has the formula: -S- R^a, where Raa is 
an optionally substituted termiaal portion of a ribosomally-specified amino acid side 
15 chain, or an analog of the terminal portion of a ribosomally-specified ammo acid 
side chain. 

The invention particularly provides the embodiment of such a synthetic 
protein wherein the protein has a biological activity possessed by a ribosomally- 
produced bioactive protein. 

20 The invention further concerns such synthetic proteins wherem a plundity of 

amino acid residues of the protein axe bound to adjacent amnio acid residues by an 
amide bond, and such synthetic proteins wherein a plurality of amino acid residues 
whereui at least one of such amino acid residues is bonded to an adjacent amino acid 
residue by a non-amide bond (e,g., .a thioester bond, a thioether bond, and an oxime 

25 bond). 

The invention further concerns such synthetic proteins wherein the pseudo- 
ammo acid residue is a D-configuiation amino acid residue or is an an L- 
configuration amino acid residue, or wherem such synthetic protems contain both 
pseudo-amino acid residue(s) in the D-configuration and pseud-ammo acid 
30 Fesidue(s) in die L-configuration. 



wo 02/20034 



PCT/USOl/21935 



The invention further concerns such synAetic proteins wherein the pseudo- 
amino residue is selected from the group consisting of a pseudo-arginine; a pseudo- 
asparagine; a pseudo-aspartate; a pseudo-dopamine; a pseudo-glutamate; a pseudo* 
glutamine; a pseudo-histidine; a pseudo-isoleucine; a pseudo-Ieucine; a pseudo- 
5 lysine; a pseudo-metiiionine; a pseudo-phenyalanine; a pseudo-serine; a pseudo- 
threonine; a pseudo-tiyptophan; a pseudo-tyrosine; and a pseudo-valine. 

The invention further concerns such synthetic proteins wherein the 
ribosomally-specified bioactive protein is a mammalian protein (e.g., .a human, 
simian, bovine, murine, porcine, ovine, or equine protein). 
10 The invention further concerns such synthetic proteins wherein the protein 

has a bioactivity selected of a protein receptor or fragment thereof, of a protein 
receptor ligand or fragment thereof; or of a cytokine (especially wherein the cytokine 
is selected from the group consisting of an interleukin, a lymphokine, a RANTES 
protein, an erythropoiesis stimulating protein, tumor necrosis factor (TNF), an 
1 5 interferon, a growth factors and a single peptide hormone). 

The invention further concerns the synthetic protems SEP-0, SEP-1, and 

SEP-3. 

The invention further concerns all such synthetic protehis wherein one or 
more of their amino acid residues are modified by one or more polymer adducts, and 
20 particularly wherein the synthetic protein comprises one or more amino acid residues 
that are modified by one or more polymer adducts at amino acid residue(s) that 
- correspond to at least one of the glycosylation sites of a ribosomally-encoded 
bioactive protein. 

The invention also provides a molecularly homogeneous phannaceutical 
23 composition comprismg a ^tiietic protein, wherein the protein possesses a 
biological activity that mimics a biological activity associated with a ribosomally- 
specified bioactive mammalian protein, and has a monomer molecular weight 
greater than about 25 kDa, and contdns a pseudo-amino acid residue whose side 
chain has the formula: -S-Raa,. where Raa is selected from the group consisting of an 
30 optionally substituted terminal portion of a ribosomally-specified amino acid side 
chain, or an analog thereof 
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The invention also provides a method of treating a human disease or 
condition that comprises administering to an individual in need of such treatment an 
effective amount of a pharmaceutical composition comprising one or more 
molecularly homogeneous pharmaceutical compositions each comprising a synthetic 
5 protein, wherein the synthetic protein has a monomer molecular weight greater than 
about 25 kDa, and contains a pseudo-ammo acid residue whose side chain has the 
formula: -S-R«a, where iUa is selected fix)m the group consisting of an optionally 
substituted termmal portion of a ribosomally-specified amino acid side chain, or an 
analog thereof, the syntiietic protein possessing a biological activity that mimics a 
10 biological activity of a ribosomally-specified bioactive human protein receptor or 
fiagment thereof protem receptor ligand or fragment thereoi^ or a cytokine. 

The mv^tion also provides a method for synthesizing a desked polypeptide, 
and the desued polypeptide made ±roug^ such method, wherein the desired 
polypeptide has the formula: 

15 aaNHj-Q-aax-aay-W-aacooH 

where Q and W each denote the optional presence of one or more additional amino 
acid residues, aa^H^ ^l^otes the N-terminal amino acid residue of the polypeptide; 
aax and aa^ denote internal adjacent amino acid residues, having side chains x and y, 
respectively, and aacooH denotes the C-terminal ammo acid residue of the 

20 polypeptide; the method comprising: 

(A) ligating a first peptide havmg the formula: aaNH2-Q-aax-C0SR, 

wherein R is any group compatible with the thioester group, 
including, but not limited to, aryl, benzyl, and alkyi groups, to a 
second peptide having the formula: Cys-W-aacooH to thereby form 
25 the polypeptide: aaNH^-Q-aax-Cys-W-aacooH; and 

• (B) incubating the polypeptide in the presence of a reagent Raa-X, where 
Raa is a group whose structure munics the terminal portion of the side 
chain of amino acid residue aay; and X is a good leavmg group 
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(especially a halogen, such as F, t or Brj; the incubation being under 
conditions sufficient to form the desired polypeptide. 
The invention fijrther provides tfie embodiment of such method and 
polypeptide wherein the amino acid residue aay is selected from Ae group consisting 
5 of a pseudo-arginine; a pseudo-asparagine; a pseudo-aspartate; a pseudo-dopamine; 
a pseudo-glutamate; a pseudo-glutamine; a pseudo-histidine; a pseudo-isoleucine; a 
pseudo-leucine; a pseudo-Iysine; a pseudo-methionine; a pseudo-phenyalanine; a 
pseudo-serine; a pseudo-threonine; a pseudo-tryptophan; a pseudo-tyrosine; and a 
pseudo-valine. 

10 Description of the Preferred Embodiments 

The present invention concerns methods and compositions for extending the 
technique of native chemical ligation to permit the ligation of a wider range of 
peptides, polypeptides, other polymers and other molecules via an amide bond, thus 
facilitating the chemical synthesis of such molecules. The invention further provides 
15 methods and uses for such proteins and derivatized proteins. The invention is 
particularly suitable for use in the synthesis of optionally polymer-modified, 
synthetic bioactive proteins, preferably having a monomer molecular weight of 
greater than about 25 kDa, and of pharaiaceutical compositions that contain such 
proteins. 

20 L The Chemical Synthesis of Bioactive Peptides and Proteins 

Typically, the synthesis of bioactive peptides and proteins employs the 
. '*Merrifield"-chemistry stepwise solid phase peptide synthesis protocol developed in 
the early 1960's, using standard automated peptide synthesizers. Such synthesis may 
employ solid or solution phase ligation strategies. While such chemistry may be 
25 readily employed to produce many polypeptides, it is unsuitable for the production 
of protems or large polypeptides due to associated yield losses, by-product 
production, and incomplete reactions. To address these limitations, techniques of 
chemical ligation have been developed ftat permit one to ligate together preformed 
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peptide fragments in order to achieve the synthesis of larger polypeptides and 
proteins. 

Chemical ligation involves the formation of a selective covalent linkage 
between a first chemical component and a second chemical component Unique, 
5 mutually reactive, functional groups present on the first and second components can 
be used to render the ligation reaction chemoselective. For example, the chemical 
ligation of peptides and polypeptides involves the chemoselective reaction of peptide 
or polypeptide segments bearing compatible Unique, mutually reactive, C-terrainal 
and N-termmal amino acid residues. Several different chemistries have been utilized 

10 for this purpose, examples of which include native chemical ligation (Dawson, et al. 
Science (1994) 266:776-779; Kent, et al, WO 96/34878; Kent, et a/., WO 
98/28434), oxime forming chemical ligation (Rose, et ai, J. Amer. Chem, Soc. 
(1994) 116:30-34), thioester . formmg ligation (SchnSlzer, et a/.. Science (1992) 
256:221-225), thioether forming ligation (Englebretsen, et aL, Tet. Letts. (1995) 

15 36(48):8871-8874), hydrazone forming ligation (Gaertner, et al., Bioconj. Chem, 
(1994) 5(4):333-338), and thiazolidine forming ligation and oxazolidine forming 
ligation (Zhang, et a/., Proc, Natl Acad Set (1998) 95(16):9184-9189; Tarn, et al, 
WO 95/00846; US Patent No. 5,589,356) or by other metiiods (Yan, L.Z. and 
Dawson, P.E., "Synthesis of Peptides and Proteins without Cysteine Residues by 

20 Native Chemical Ligation Combined with Desulfiirization," J. Am. Chem. Soc. 200 1 , 
123, 526-533, herein incorporated by reference; Gieselnan et al. Org. Lett, 2001 
3(9):133M334; Saxon, E. et a/., •Traceless" Staudinger Ligation for the 
Chemoselective Synthesis of Amide Bonds. Org. Lett 2000, 2, 2141-2143). 

Where the ligation involves the joining of a polypeptide that possesses an N- 

25 terminal cysteine residue, the procedure of native chemical ligation is preferably 
employed (Dawson, et al. Science (1994) 266:776-779; Kent, et al., WO 96/34878; 
Kent, et al, WO 98/28434; U.S. Patent Application Serial No. 09/097,094, all herem 
incorporated by reference). Native chemical ligation mvolves a chemoselective 
. reaction between a first peptide or polypeptide segment having a C-terminal a- 

30 carboxythioester moiety and a second peptide or polypeptide having an N-terminal 
cysteine residue. A thiol exchange reaction yields an initial thioester-Iinked 
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intermediate, which spontaneously rearranges to give a native amide bond at the 
ligation site while regenerating the cysteine side chain thiol. In many instances, the 
sequence of the natural protein will comprise suitably placed cysteine residues such 
that polypeptide fragments having an N-terminal cysteine residue may be 
5 synthesized and used in a native chemical ligation reaction. 

Although this methodology has proven to be practical, robust and useful for 
the synthesis of large polypeptides and proteins, its requirements for a cysteine 
residue at the site of ligation limits its applicability. Cysteine is one of the least 
common amino acids. A typical protein domain comprises 150-200 ammo acids; 
10 many proteins contain regions of greater than 60 amino acids in length that contain 
no cysteine residues. One solution to this problem involves designing a non- 
naturally occurring protem in which one or more of the protein's naturally occurring 
amino acid residues have been replaced with cysteine residues, thus permitting the 
native chemical ligation method to be employed. This solution is often complicated 
1 5 by the unpredictable effects that the introduction of a cysteme residue may have on 
protein structure or function. Such cysteine residues may be used to covalently 
immobilize the synthesized protem. 

The present invention provides two alternative solutions to this problem that 
may be employed separately or conjunctively. The first such solution (temed herem 
20 'Tseudo-Native Chemical Ligation") involves the use of non-naturally occuiring 
pseudo-amino acid residues at preselected positions in the peptides employed in the 
protein synthesis. The structures of such pseudo-amino acids mimic both the 
structures of cysteine and the structiros of the amino acids that are naturally found at 
such preselected positions in the protein being synthesized. The second such 
25 solution (termed herein "Extended Native Chemical Ligation") is described in U.S. 
Patent Application Serial No. 60/231,339 (Kent, et al, filed: September 8, 2000), 
herein incorporated by reference, and mvolves ligating a first component comprising 
a carbojQ^l thioester, and more preferably, an a-carboxyl thioester with a second 
component comprising an acid stable N-substituted, and preferably, Na-substituted, 
30 2 or 3 carbon chmn ammo aUq^l or aiyl thiol. Each' of these solutions is discussed in 
detail below. 
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A. Pseudo-Native Chemical Ligation 

Pseudo-native chemical ligation is directed to the thioalkylation of cysteine 
side chains generated at ligation sites from native chemical ligation. A preferred 
aspect is thioalkylation of cysteine ligation sites wherein at least one peptide 
5 contains a native cysteine having its thiol side chain protected with a suitable 
protecting group. 

In a preferred embodiment of the invention, the thiol moiety of a cysteine 
group is modified into a desired side chain, for example, into the side chain of a 
ribosomally-specified amino acid, an analog of such an amino acid, or into a non- 
10 ribosomally-specified amino acid. As used herein, a ribosomally-specified amino 
acid is an amino acid that is recognized by ribosomes in the process of protein 
translation and can be incorporated mto a ribosomally produced protein. 
Considerable published literature exists describmg chemical modifications of the 
cysteine side chain thiol moiety (see, e.g., "Current Protocols in Proteua Science," 
15 Edited by: John E. Coligan et a/., John Wiley & Sons, NY (2000)). Kaiser, E.T. has 
described the conversion of cysteine residue side chains to munic the chemical 
properties of a naturally occurring amino acid side chain (see, e.g., Kaiser, E.T. et 
fl/., "Chemical Mutation Of Enzyme Active Sites," Science, 1984 Nov 
2;226(4674):505-ll). Additionally, flie use of a cysteine side chain to introduce a 
20 . label mto a peptide or protein has been described. Cysteine side cham modifications 
are reviewed in Chemistry of Protein Conjugation and Crosslinkmg, S. S. Wong, 
(1991, CRC Press); Chemical Modification of Proteins, Gary E; Means et al, (1971, 
Holden-Day), Chemical Modification of Proteins: Selected methods and analytical 
procedures, Glazer, A.N. et d. (1975, Elsevier); Chemical Reagents for Protein 
25 Modification, RL Lundblad (1991, CRC Press). Tam et al. (Biopolymers (1998) 
46:319-327) have disclosed the use of homocysteine (-CH2-CH2-SH) for non-cys 
native chemical ligation, followed by tiiioallcylation using methyl p- 
nitrobenzenesulfonate (methylating reagent) to convert the homocysteine side chain 
to a native methiomne side chain (-CH2-CH2-S-CH3). The present invention also 
30 can be used for converting homocysteines to pseudo amino acids as well, i.e., to 
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amino acids other than methionine. However, as with tfie conversion of cysteines 
described herein, in accordance with the present invention it is necessary to use 
protecting groups to avoid destruction of native cysteines involved in disulfide 
pairing for peptides that contain at least one native cysteine that one does not wish to 
5 convert. Suitable protecting groups are described below. 

While the method of pseudo-native chemical ligation does not facilitate the 
municking of the side chains of certain ribosomally-specified amino acids (e.g., the 
side chains of glycine, alanine, valine, and proline) (alanine's side chain can, 
however, be formed through a desulfurization reaction (Yan, L.Z. and Dawson, P.E., 

10 "Synthesis of Peptides and Proteins without Cysteine Residues by Native Chemical 
Ligation Combined with Desulfurization," J, Am, Chem. Soc. 2001, 123, 526-533, 
herein incorporated by reference), it may be used to form side chains that mimic 
many ribosomally-specified or non-encoded amino acids. Amino acids produced in 
accordance with the pseudo-native chemical ligation method of the present invention 

1 5 will contain a thioether linkage, and will have no beta-branching (in that they will all 
include a methyl group at the beta position, i.e., aa-pCH2-S-. Thus, the pseudo- 
amino acid versions of the beta-branched amino acids, isoleucine'and threonine can 
be made to have the pendant side chain structure, without having the beta geometry 
and its attendant constraints. 

20 Significantly, the methods of the present invention may be used to form 

amino acid side chains that are the same length as that of ribosomally-specified 
amino acids, or are longer or shorter than such length. Such alteration in side oham 
length can be used to stabilize (or destabilize) the three-dunensional conformation to 
increase protein stability (or to enhance the ability of the protein to alter its 

25 conformation and thereby accept a different range of substrates, mhibitors, receptors, 
ligands, etc, relative to those accepted by the naturally occurring, protein. For 
example, Cys-CHi-SH + Br-CH2-COOH yields Cys-CH2-S-CHrC0OH (such 
"pseudo-glutamic acid" has one additional side cham atom, namely the -S- group; 
alternatively, if used in the place of aspartic acid, it will possess two additional side 

30 chain atoms, namely a -CH2-S- group). Other side chains have the same number of 
atoms in the side chain, but differ by inclusion of the thioether linkage (-S-). For 
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example, Cys-CH2-SH + Br-CH2-CH2-NH-?G, iFollowed by removal of PG yields 
Cys-CH2-S-CH2-CH2-NH2. The resulting structure has no additional atoms in the 
side chain, but one -CH2- group is replaced with -S-. Methionine is another example 
here, Cys-CH2-SH + K;H2-CH3 yields Cys-CH2-S-CH2-CH3 (versus native met 
5 structure of Met-CH2-CH2-S-CH3); thus the thioether is relocated. Arginine also: 
Cys-CH2-SH + Br-CH2-NH-CH((.NH2)(-NH2t)) yields Cys.CH2^S-CH2-NH-CH((. 
NH2)(= NH2^). Preferably, protection of reactive amino groups, particularly for the 
constructing pseudo lysine can be employed to avoid unwanted side reactions. Once 
the thibalkylation reaction is performed, the protecting group can be removed. 

10 In general, where the desire is to mfanic a naturally occurring protein as 

closely as possible, it is most preferred to employ a pseudo amino acid molecule 
having a side chain length that is the same length as that of the ribosomally-specified 
amino acid normally present at such position in the protein; it is less preferred to 
employ a pseudo amino acid molecule having a side chain length that is one atom 

15 longer than that of the ribosomally-specified amino acid, and still less preferred to 
employ a pseudo amino acid molecule having a side chain length.that is two atoms 
longer than that of the ribosomally-specified amino acid. Moreover, its is preferred 
to select a cysteine ligation site that is in a location where genetic changes are not 
likely to disrupt fimction or where ammo acids at that site in related proteins are 

20 conserved. Such sites can be identified by alanfaie scanning, homology modelmg, 
and other methods. 

In pseudo-native chemical ligation, a peptide containmg an amino terminal 
cysteine residue is ligated to a peptide having a carboxy terminal thioester, as in 
native chemical ligation. The thiol side chain of the cysteine is then reacted wifli a 

25 compound of the formula Raa-X, where X is a good leaving group,, and Raa is a group 
whose structure mimics the terminal portion of the side chain of an ribosomally- 
specified or synthetic amino acid. 

Significantly, the reactions of pseudo-native chemical ligation work with 
either the natural L-configuration of cysteine side chain, or the D-configuration. Use 

30 of the D-configuration can unpart protease resistance at that ligation site, and thus 
may be desired when mcreased stability to proteolysis is desured. However, in using 
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the D-cysteine, the backbone structure at that side will be altered. Such alteration, in 
addition to protease resistance, may be desired to alter bioactivity. However, to 
minimize impact on bioactivity, it is preferred to locate the D-cysteine at a site of 
high flexibility, such as a disorded region, such as at a disorded loop that will be 
5 located on the surface of the resulting folded molecule, on at a disorded terminus of 
the molecule. Desirably, the reactions of pseudo-native chemical ligation may be 
used to place large, charged side chains (e.g., the side chains of Lys, Arg, Asp or 
Glu) on the surface of the synthesized molecule. 

Examples of suitable good leaving groups, X, include halogens, especially 

10 Iodine and Bromine. Examples of R^a groups include PO4, COOH, COO, CONH2, 
guanidinium, amine, alkyl, substituted alkyl, aryl, substituted aryl, imidazole, 
alkylated imidazole, indole, or alkylated indole groups. 

The selection of which Raa to employ will depend upon the amino acid side 
chain desired to be present at a particular position. Thus, for example, a desired 

15 polypeptide or protein havmg the amino acid sequence; 

aaNH2-Q-aax-aay-W-aacooH 

where Q and W each denote the optional presence or absence of additional 
amino acid residues, and aax and aay denote internal adjacent residues (having side 
chains x and y, respectively), and aaNHj and respectively denote the amino (N-) 
20 termimal residue and the carboxy (C-) terminal residue of the polypeptide or protein 
an be synthesized by preparing two peptide fragments: 

aaNH2-Q-aax-C0SR and Cys-W-aacooH 

where Cys denote the replacement of aay with cysteine, and R is any group 
compatible with the thioester group, including, but not Ifanited to, aryl, benzyl, and 
25 alkyl groups. Examples of R mclude 3-carboxy-4-nitrophenyl thioesters, benzyl 
esters, and mercaptoproprionic acid leucine esters (See, e.g., Dawson et al. Science 
(1994) 266:776-779; Canne et d. Tetrahedron Lett. (1995) 36:1217-1220; Kent, et 
al, WO 96/34878; Kent, et al, WO 98/28434; togenito et al, JACS (1999) 
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121(49):n369-n374; and Hackeng et d., Proc. Natl. Acad. Sci. U.S.A. (1999) 
96:10068-10073). Other examples include dithiothreitol, or alkyi or aryl thioesters, 
which can be produced by intein-mediated biological techniques, which also are well 
known (See, e.g., Chong et al. Gene (1997) 192:277-281; Chong et al., Nucl. Acids 
5 Res. (1998) 26:5109-5115; Evans et al. Protein Science (1998) 7:2256-2264; and 
Cotton et a/.. Chemistry & Biology (1999) 6(9):247-256); and then ligating the 
fragments together to form: 

. aaNHj-Q-aax-Cys-W-aacooH. 

The ligated fragment is then reacted with Ry-X, where Ry is a side group that 

10 mimics the structure of the y side chain). The reaction is conducted under conditions 
sufficient to convert the thiol group of the cysteine into a "pseudo-y" side chain. For 
example, if R^ is selected to be CH2-COOH or (CH2)2-COOH, then the reaction will 
lead to the formation an amino acid residue that mimics the structure and function of 
aspartic acid C*pseudo-Asp") or glutamic acid ("pseudo-Glu"). As will be 

15 appreciated, in light of the above description, more complicated synthesis can be 
conducted employing more than two peptide fragments. 

Native chemical ligation involving "pseudo amino acids" according to the 
present mvention significantly expands the applicability of the native chemical 
ligation method for the chemical synthesis of proteins. Native chemical ligation 

20 involving "pseudo amino acids" uses the same chemistry in the ligation step, 
forming a peptide bond at the Cys residue with a native Ihiol-contafaiing side cham 
fimctionalitjr, in a unique second step, the native thiol-containing side chain of the 
Cys at the site of ligation* is converted by chemoselective reaction to yield a non- 
native side chain that contams the same fim'ctional group as a genetically-encoded 

25 ammo acid, but a non-native side chain. 

For example, the Q^ine at the ligation site can be reacted with bromoacetic 
acid (or otiier haloacetic acid) at neutral pH in water; only the unprotected thiol of 
the Cys at the ligation site reacts under these conditions to give a polypeptide 
containing an S-carboxymethylated residue at the ligation site, viz: 
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This non-native amino acid contains a carboxyl moiety on the side chain, the 
same functional group as a glutamic acid or an aspartic acid residue. The S- 
carboxymethylated Cys is referred to herein as a ''pseudb glutamic acid" C^seudo- 
glutamate," '*pseudo-Gltf 0 (where one extra side chain atom is present and is the -S- 
group forming a thioether Imkage) (it also may be referred to as a "pseudo-aspartic 
acid" where two extra side chain atoms are present, one of which is the -S- group 
forming a thioether linkage). For many proteins, the simple preservation of the 
charged carboxyiate side chain even in this '*pseudo amino acid" will be sufficient to 
retain or impart desired properties to the protein. A key difference is that it functions 
as a native chemical ligation site at a position devoid of a suitable cysteine. 

Use of other reagents to modify the side chain thiol in Cys residues at the 
ligation site(s) can give other "pseudo ammo acid" residues. For example, reaction 
10 of the thiol with a haloacetamide, e.g. bromoacetamide, will give a "pseudo 
glutamine" at the ligation site, viz: 
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H2NOC' 



Similarly, reaction with an N-protected haloalkylamine, e.g. 

O 




5 followed by removal of the amino protecting group will give a *pseudo-lysine' 
residue^ viz: ■ 




Reaction of the Cys residue(s) at the ligation site(s) with a suitable haloalkane, e.g. 



will yield a 'pseudo leucine' amino acid, viz: 
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fr 




Other suitably chosen aiyl-containing halides, where R can be H, OH, aryl etc., e.g. 



can be used similarly to generate pseudp amino acids corresponding to Phe, Tyr, or 
Trp residues, viz: 



10 



15 




20 



A significant feature of this approach is that cysteine residues that one does 
not wish to modify in the reacting segments be side chain protected, e.g. as 
Cys(Acm), to prevent chemical modification of Cys residues other than the one(s) at 
the ligation site(s), or that any other Cys residues in the reacting segments be 
intended for the simultaneous foraaation of identical *pseudo amino acids' by the 
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chemical modification reaction after ligation. It will be appreciated tfiat tfie same 
principles apply to the use of homocysteine at the ligation site. It will also be 
appreciated that selenocysteine can be employed at a ligation site through 
selenocysteine-mediated native chemical ligation, and converted by similar 
5 alkylation reactions to generate pseudo amino acids of the invention, where a 
selenoether group replaces the thioether group. 

As used herem, the symbol cp denotes a benzyl group; IM denotes an 
hnidazole group, and IN denotes an indole group; PG denotes a protecting group. 
Below is a sunmiary of Raa side groups that may be used to synthesize peptides 
10 containing pseudo-amino acid residues in accordance with tfie present' invention 
(where X is a halogen (I, Br, CI, F, with I and Br preferred for most, and F preferred 
for 9 attachment): 

Basic Amino Acids: 

Lys (no extra atoms) 
15 -CH2-SH + X-CH2-CH2-NH-PG, followed by deprotection 

gives -CH2-S-CH2-CH2-NH2 

Arg (no extra atoms) 

-CH2-SH + X-CH2-NH-C((NH2)(=NH2+)) 
gives -CH2-S-CH2-NH-C((NH2)(-NH2+)) 

20 His (2 extra atoms) 

-CH2-SH + X'CHi'IM'PG, followed by deprotection gives -CH2-S-CH2-iM 
Acidic Amino Adds: 
Asp (2 extra atoms) 

-CH2-SH + X-CH2-COOH gives -CH2-S-CH2-COOH 
25 Glu (1 extra atoms) 

.-CH2-SH + X-CH2-COOH gives -CH2-S-CH2-COOH 
Uncharged Polar Amino Acids: 
Tyr (no or 1 extra atom) 
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-CH2-SH + F-q)i)OH gives -CH2-S-(i)-pOH)(no extra atoms, same geometry 
-CH2-SH + Br/I-CH2-(p-pOH gives -CH2-S-CH2-9-POH (1 (Sftraatom) 

Gin (1 extra atom) 

-CH2-SH + X-CH2-C(0)(NH2) gives -CH2-S-CH2-C(0)(NH2) 
5 Asa (2 extra atoms) 

-CH2-SH + X-CH2-C(0)(NH2) gives -CH2-S-CH2-C(OXNH2) 
Ser (2 or 3 extra atoms) 

-CH2-SH + X-CH2OH) gives -CH2-S-CH2OH (2 extra atoms) 
-CH2-SH + X-CH2-CH2OH) gives .CH2-S-CH2-CH1OH (3 extra atoms) 

10 Tiir (2 or 3 extra atoms, missing beta branching) 

-CH2-SH + X-CH((CH3)(0-PG)) followed by removal of PG 
gives -CH2-S-CH((CH3)(OH)) (2 extra atom) . 
-CHrSH + X-CH2-CH((CH3)(0-/'(?)) followed by removal of PG 
gives -CH2-S-CH2-CH((CH3)(OH)) (3 extra atoms) 

IS Non-Polar Amino Adds: 

Leu (1 or 2 extra atoms) 

.CH2-SH + X-CH((CH3)(CH3)) gives -CH2-S-CH((CH3)(CH3)) (1 extra 
atom) , 

-CH2-SH + X-CH2-CH((CH3)(CH3)) gives -CH2-S-eH2-CH((CH3)(CH3)) (2 
20 extra atoms) 

He (2 or 3 extra atoms, missing beta branching) 

-CH2-SH + X-CH(CH3)-CH2-CH3 gives -CH2-S-CH(CH3)-CH2-CH3 (2 extra 
atoms) 

-CH2-SH + X-CH2-CH(CH3)-CH2-CH3 gives -CH2-S-CH2-CH(CH3)-CH2- 
25 CH3 (3 extra atoms) 

Phe (I or 2 extra atoms) 

-CH2-SH + F-(p gives -CH2-S-<p (1 extra atom) 

-CH2-SH + Br/I-CH2-(p gives -CH2-S-CHr(p (2 extra atoms) 

Met (no extra atoms) 
30 -CH2-SH + I-CH2-CH3 gives -CH2-S-CH2-CH3 

Trp(l extra atom) 
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-CH2-SH + F-iAT gives 

The general pseudo chemical ligation reaction is illustrated in Figure 1. 
Preferred Raa and X groups that can be used to form preferred pseudo-amino acids 
are shown in Table I. Residues are shown as uncharged, however, it will be 

5 appreciated that ionizable groups such as NH2 or COOH may be in their charged 
form, or provided as salts. All reactions except that used to form pseudo-lysine are 
aqueous and are conducted at a pH of 8-9, where the stocheometry and optimization 
of the reactions can be adjusted following standard protocols. Pseudo-lysine is 
formed in an aqueous reaction, conducted at pH 8-9, followed by reaction in 

10 piperidine/HiO (trifluoracetic acid (TFA)). 



Table I 


Raa -X Reagent 


Product of Reaction 


Pseado- 
Amino 
Acid 


Mimics 

or 
Replaces 


X-CH2COOH 

(X=I,Br). 


Cys 

^S-CHaCOOH 


Pseudo-Asp 
(+2 atoms) 

Pseudo-Glu 

(+1 atom) 


Aspartic 
Acid 
Glxitamic 
Acid 


X- 

(CH2)2COOH 
(X=I.Br) 


Cys 

^S— (CH2)2CO0H 


PseudorGlu 
(+2 atoms) 


Glutamic 
Acid 


X-(CH2)- 
-C(0)NH2 

(X=I,Br) 


0^ 

0 


Pseudo-Gin 
(+1 atom) 
Pseudo-Asn 
(+2 atoms) 


Glutamine 
Asparagine 


X-CH2OH 
(X=I,Br) 




ST 


Pseudo-Ser 
(+2 atoms) 


Serine 


X-CHOHCH3 
(X=I.Br) 


— cys 

I CHOH 


Pseudo-Thr 
(+2 atoms) 


Threonine 


X- 

CH2CHOHCH3 
(X-I,Br) 


cys 


Pseudo-Thr 
(+3 atoms) 


Threonine 
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Table I 


Sfla -X Reagent 


Prodnct of Reacdon 


Fseudo- 
Amino 
Acid 


Mimics 

or 
Replaces 


X-CH2CH2OH 
(X«I,Br) 


Ojs 

^S^CHzOH 


Pseudo-Ser 
(+3 atoms) 


Serine 


X-(CH2)2NH- 
PG 

(X=I.Br) 


— cys — 


Pseudo-Lys 


Lysine 


X-CH2NH- 
-C(NH2)2 

(X=I,Br) 


Cys- 


Pseudo-Arg 


Arginine 


X-(CH2)2NH- 
-C(NH2)2 

(X=I,Br) 


NH2 


Pseudo-Arg 
(+1 atom) 


A - - * • 

Argimne 


X-CH2CH3 
(X=I,Br) 


S""CH2CH3 


Pseudo-Met 


Methionine 


X-9 


Cys 


Pseudo-Phe 
(+1 atoms) 


Phenyl-, 
alanine 


X-CH2<p 
(X=I.Br) 


1 


Pseudo-Phe 
(+2 atoms) 


Phenyl- 
alanine 


X-{p-OH 
(para) 

(X= I, Br) 




Pseudo-Tyr 


Tyrosine 


1 X-CH29-OH 
0)ara) 

(X=I.Br) 




Pseudo-Tyr 
(+1 atom) 


Tyrosine 


X-CH2<i)-OPQ3 
(para) 

1 (X=I,Br) 


^OPO, 


Pseudo- 
Phospho-tyr 


Phospho- 
tyrosine 
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Table I 


Raa-X Reagent 


Product of Reaction 


Pseudo- 
Amino 
Acid 


Mimics 

or 
Replaces 


X-CH2(p-(OH)2 
(X=I.Br) 


Cys 

1 


Pseudo- 


Dopamine 


X-CH(C00H)2 
(X=I.Br) 


Cys 

1 ^COOH 

^S-CH 

COOH 


Pseudo-Gla 


y-carboxy 
Glutamic 
Acid 


X-Ca2-IM-PG 
(X=I.Br) 


Cys 1^ 

S-CHa— NH 


Pseudo-His 


Histidine 


X-CH2-IN 
(X=F.I.Br) 


Cys 

H 


Pseudo-Trp 


Tryptophan 


X-CH- 

-(CH3)2 

(X=I,Br) 


— CH3 

S — CH 


Pseudo-Leu 
(+1 atoms) 
Pseudo-Val 
(+2 atoms) 


Leucine | 
Valine 1 


X-CHjCH- 

-(CH3)2 

(X=I,Br) 


Cys 

^S-CH2-CH 


Pseudo-Leu 
(+2 atoms) 


Leucine 


X-CH(CH3)- 
-CH2-CHj 

(X=I,Br) 


1 r-cH3 

S-CH 

CH3 


Pseudo-Ile 
(+2 atoms) 


Isoleucine 


X-CH2- 
-CH(CH3> 
-CH2-CH3 

i:X=i:Br) • 


Cys 

S CH 

CH3 


Pseudo-Ile 
(+3 atoms) 


Isoleucine 


X-(CH2)2NH- 
-C(0)CF3 

(X=I,Br) 


0 

^S-(CH2)2-NH-C-CF3 


Pseudo-N- 
(ethyl) 
trifluoroacet 
-amide 


N-(ethyl) 

trifluoroacet 

-amide 
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Table T "1 


Rfia-X Reagent 


Product of Reaction 


Pseiidn- 

Amino 
Acid 


jyumics 

or 
Replaces 


X-ammonium 

4-chloro-7- 

sulfobenzo- 

furazan 

(X-SBF) 

(X=I,Br) 


0 


Pseudo-SBF 


Ammonium 

4-chloro-7- 

sulfobenzo- 

fiirazan 

(SBF) 

(Pierce I 
Chemical | 
Company) 


X.CH2- 

ammonium 4- 

chloro-7- 

sulfobenzo- 

furazan 

(X-SBF) 

(X=I,Br) 


Cys 

n1^> 


Pseudo 
methyl-SBF 


Ammonium 

4-chloro-7- 

suifi)benzo- 

furazan 

(SBF) 

(Pierce 

Chemical 

Company) . 



In a further embodiment of the present invention, the thiol group of the 
cysteine residue can be oxidized (-SH -> -SO3O. or replaced with oxygen to fonn 
serine (-SH ^ -OH). Such replacement can be accomplished by reacting the 
cysteine with tosyl-Cl (Tos-Cl) thereby converting the thiol group, -SH, into an -S- 
Tos group. In the presence of strong base, OH replaces the Tos substituent, thereby 
forming die desired -OH group. 

Additionally, the thiol group of the cysteine can be modified through a 
Michael addition reaction, as shown below: 
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where R is H, alkyl, aiyl, alkylaiyl, amino, or is a polymer molecule, which 
may be brancehed or unbranched, random or block. 

A further aspect of tfie present mvention is directed to ttie use of a peptide 
fragment whose amino terminal cysteine residue has been modified to contain an 
5 alkylamino (and preferably a methylamino) moiety: 

^^^Zs^C — Peptide A 

HS-^ 

- NH2 

By protecting the amino group of the alkylamino moiety, one can ligate the 
PeptideA to a second peptide having the general formula: Peptide B-aCOSR. Upon 
removal of the protecting group, an additional Peptide having the general formula: 
10 Peptide C-.aCOSR can be ligated to the previously ligated peptides, to form a 
branched peptide complex: 
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0 

Peptide B -COSR ^ '^"^ v^C~Peptide A 



O 



I NH(protected) 



II ° 
Peptide B -C. NH J|_pepOde A 

NH(protected) 

O ^ « 

II 

Peptide B -C NH |l_pep«deA 

+ HS-\ 
PqjtideC-COSR |{|h, 



O 

II ° 
Peptides -C NH^il^peptideA 



NH 

Peptide C —C=0 

Since the branched protein still contains the thiol group of the q^steme, it 
will react with an R^-X molecule in the manner described above, to thereby pemit 
further modifications of the side chain. In particular, by using an allq^lamide as the 
5 R substituent, an additional amino group can be introduced mto the molecule, 
thereby permittmg further branching, or further ligation reactions: 

o 

II H O 

Peptide B-C—NH^S-Peptlde A || ^ 



-^^-f-ep^^e^ Peptide B^^_NH^, 

+ HS>^ ^ 

I-CH2CONH2 NH ^^^^0 

O 



C — Peptide A 



Peptide C— C=0 



NH 

0=.c— Peptide C 

In a further embodiment of the present invention, the modification of the 
cysteine side cham can be used to introduce any of a variety of labels or binding 
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molecules. For example, labels detectable by electron spin resonance, fluorescence 
(e.g., ammonium 4-chIoro-7-sulfobenzo-furazan (SBF) Pierce Chemical CoOor 
magnetic imaging may be employed. Such labeling is useful in the diagnosis of 
diseases and conditions, as well as in the analysis of molecular interactions (e.g., for 
5 measuring distances in mtegral m«nbrane proteins). The methodology of the 

present invention has several salient advantages with respect to the use of labels and 
binding molecules. Such substituents can be incorporated in a site specific,, non- 
random manner. Additionally, the label moiety need only be stable to subsequent 
chemical ligation steps (if any). Such reactions are generally conducted under veiy 

10 mild conditions (such as those employed in native chemical ligation; removal of 
Acm or otfier protecting groups that may optionally be used to protect the thiol 
groups of other cysteine residues (as with Hg(CH2COOH)2); and reverse phase 
HPLC purification). Protection of such additional thiol groups prevents oxidative 
polymerization of Cys-containing peptides and thus facilitates their purification and 

15 handling. 

B. Extended Native Chemical Ligation 

As indicated above, where the sequence of a natural protein does not contain 
a suitable cysteine residue, and it is either inconvenient or undesirable to modify a 
natural protein sequence so as to introduce a cysteine residue at the N-terminus of a 

20 polypeptide, the method of 'Extended Native Chemical Ligation" may be used to 
ligate polypeptides whose N-terminus has been modified to contain an N-substituted, 
and preferably, Na-substituted, 2 or 3 carbon chain amino allqrl or aryl thiol, and 
thereby permit the principles of native chemical ligation to be employed with 
polypeptides lacking cysteine residues (see, U.S. Patent Application Serial No. 

25 60/23 1 ,339, herein mcorporated by reference). 

The method of 'Extended Native Chemical Ligation" involves ligating a first 
component comprising a carboxyl ftioester, and more preferably, an a-carbo;^I 
thioester with a second component comprising an acid stable N-substituted, and 
preferably, Na-substituted, 2 or 3 carbon chain ammo alkyl or aryl thiol. 

30 Chemoselective reaction between die carbojqlhioester of the first component and the 
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25 



thiol Of the N-sabstituted 2 or 3 carbon chain" alkyl or aiyl thiol of the second 
component proceeds through a thioester-linked intemediate, and resolves into an 
initial ligation product More specifically, the thiol exchange occurring.between the 
COSR thioester component and the amino alkyl thiol component generates a 
5 thioester-linked intermediate Ugation product that after spontaneous rearrangement 
generates an amide-linked first ligation product through a 5-membered or 6- 
membered ring intermediate depending upon whether the amino alkyl thiol 
component has formula I or II, respectively: 

J1-C(0)-N(C1(R1)-C2-SH)-J2 I 
10 J1-C(0)-N(C1(R1)-C2(R2)-C3(R3)-SH)-J2 n 

where Jl is a peptide or polypeptide having one or more optionally protected 
amino acid side chains, or a moiety of such peptide or polypeptide, a polymer, a dye, 
a suitably functionalized surface, a linker or detectable marker, or any other chemical 
moiety compatible. with chemical peptide synthesis or extended native chemical 
ligation; Rl, R2 and R3 are independently H or an electron donating group 
conjugated to CI; with the proviso that at least one of Rl, R2 and R3 comprises an 
electron donating group conjugated to CI; and J2 is a peptide or polypeptide having 
one or more optionally protected amino acid side chains, ora moiety of such peptide 
or polypeptide, a polymer, a dye, a suitably fimctionalized surfece, a Imker or 
detectable maricer; or any other chemical moiely compatible with chemical peptide 
synthesis or extended native chemical ligation. 

The N-substituted 2 or 3 carbon chain alkyl or aiyl thiol [HS-C2-C1(R1)-] or 
[HS-(C3(R3>C2(R2)-C1(R1H at the ligation site is amenable to being removed, 
under peptid&^sompatible conditions, without damage to the product, to generate a 
final ligation product of formula m, having a native amide bond at tiie ligation site: 

ji-c(o>HN-j2 m 

where Jl , J2, Rl , R2. and R3 are as defined above. 
The Rl, R2 and R3 groups are selected to facilitate cleavage of the N-Cl • 
bond under peptide compatible cleavage conditions. For example, electron donating 



15 



20 
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groups, particularly if conjugated to CI, can be used to ferm .a resonance stabilized 
cation at CI that facilitates cleavage. The chemical ligation reaction preferably 
includes as an excipient a fhiol catalyst, and is carried out around neutral pH 
conditions in aqueous or mixed organic-aqueous conditions. Chemical ligation of 
5 the first and second components may proceed through a five or six member ring that 
undergoes spontaneous rearrangement to yield an N-substituted amide linked 
ligation product. Where the first and second components are peptides or 
polypeptides, the N-substituted amide linked ligation product has formula IV or V: 

Jl-C(0)-Na(Cl(Rl)-C2-HS)-CH(Z2>C(0)J2 IV 
10 Jl-C(0)-Na(Cl(Rl>C2(R2)-C3(R3>HS)-CH(Z2)-C(0)-J2 V 

where Jl, J2 and Rl, R2, R3 are as defmed above and Z2 is an amino acid 
side chain or a d^ivative of such side chain.. 

The conjugated electron donating groups Rl, R2 or R3 of the N-substituted 
amide bonded ligation product facilitate cleavage of the N-Cl bond and removal of 
15 the 2 or 3 carbon chain alkyl or aryl thiol from the N-substituted amide-linked 
ligation product Removal of the alkyl or aryl thiol cham of the N under peptide- 
compatible cleavage conditions generates a ligation product having a native amide 
bond at the ligation site. Where the first and second components are peptides or 
polypeptides, the ligation product will have the formula: 



20 



Jl-CONaH-CH(Z2)-C(0)-J2 
Exemplary Rl subtituents for Formula I are depicted m Table n. 



X 
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m. 



Exemplary Rl, R2 and R3 substituents for Formula II are depicted in Table 
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Table m 



Formula II 




HNa CH — C J2 



R3 R2 



RI, R2 and R3 Substituents (CI included for reference) • 




As with the N-substituted 2 carbon chain compounds, positioning of the 
benayl and picolyl electron-donating substituents RI', R3' and R5' in the ortho or 
para positions is necessaiy to maintain electronic conjugation to the CI carbon for 

S robust cleavage of the Na-Cl bond following ligation. However, when R2 and R3 
form a benzyl group with C2 and C3, at least one of Rr and R3' comprises a strong 
electron donating group, where RI J or R3* is selected from methoxy (-0CH3), thiol 
(-SH), hydroxyl (-0H), and thiomethyl (-SCH3). For the N-substituted 3 carbon 
cham thiols in which R2 and R3 are hydrogens, RI comprises a benzyl or picolyl 

10 group in which Rl% R3' and RS' include either strong or moderate electron-donating 
groups, or a combination thereof As with the N-substituted 2 carbon chain alkyl or 
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aryl thiols, the strong electron-donating groups enhance the sensitivity of the 3 
carbon chain alkyl or aryl thiol, to cleavage following ligation. Thus a particular 
electron-donating group or combination thereof can be selected accordingly. 

Similar to the N-substituted 2 carbon chain compounds, the N-substituted 3 

5 carbon chain compounds of the present invention may include a thiol as a substituent 
of Rl in the Rr and R5' positions when available for substitution in a construct of 
interest. Here again the electron-donating thiol group is conjugated to CI and its 
introduction at these locations enables the compounds to have two routes for the 6- 
member ring forming ligation event. It also increases the local concentration of 

10 available thiols for reacting with the a-carboxy thioester, and provides for additional 
conformations in terras of structural constraiiits that can improve ligation. 

Synthesis of the N-terminal N-substituted 2 or 3 carbon cham alkyl or aryl 
thiol ammo acids of the mvention can carried out as described herein, for example, 
in Scheme I and Scheme n, and in accordance with standard organic chemistry 

15 techniques known in the art. See, e,g., "Advanced Organic Chemistry, Reactions, 
Mechanisms, and Structure," 4* Edition, J. March (Ed.), John Wiley & Sons, New 
York, NY, 1992; "Comprehensive Organic Transformations, A Guide to Functional 
Group Preparations," R. Larock (Ed.), VCH Publishers, New York, NY, 1989. They 
may be synthesized in solution, by polymer-supported synthesis, or a combination 

20 thereof. The preferred approach employs N alpha protected N alkylated S-protected 
amino alkyl- or aryl- thiol amino acid precursors. The reagents utilized for synthesis 
can be obtained from any number of commercial sources. Also, it will be well 
understood that the starting components and various intermediates, such as the 
mdividual ammo acid derivatives can be stored for later use, provided in kits and the 

25 like. 

In preparing the N-termmal Na-substituted 2 or 3 carbon chain alkyl or aryl 
tiiiol amino acids of the invention, protecting group strategies are employed. The 
preferred protecting groups (PG) utilized m the various synthesis strategies in 
general are compatible with Solid Phase Peptide Synthesis ("SPPS"). In some 
30 instances, it also is necessary to utilize orthogonal protecting groups that are 
removable under different conditions. Many such protecting groups are known and 
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suitable for tfiis purpose (See, e.g., *Trotecting Groups in Organic Synthesis", 3rd 
Edition, T.W. Greene and P.GM. Wuts, Eds., John Wiley & Sons, Inc., 1999; 
NovaBiochem Catalog 2000; "Synthetic Peptides, A User's Guide," GA. Grant, Ed., 
W.H. Freeman & Company, New York, NY,1992; "Advanced Chemtech Handbook 

5 of Combinatorial & Solid Phase Organic Chemistry," WD.. Bennet, J.W. 
Christensen, L.K. Hamaker, M.L. Peterson, M.R.Rhodes, and H.H. Saneii, Eds., 
Advanced Chemtech, 1998; "Priciples of Peptide Synthesis, 2nd ed.," M. 
Bodanszky, Ed., SpringerrVerlag, 1993; "The Practice of Peptide Synthesis, 2nd 
ed.," M. Bodanszky and A. Bodanszky, Eds., Springer-Verlag, 1994; and "Protecting 

10 Groups," P.J. Kocienski, Ed., Georg Thieme Verlag, Stuttgart, Germany, 1994). 
Examples include benzyloxycarbonyl (Z), Boc, Bpoc, Trt, Nps, FmocCl-Z, Br-Z; 
NSC; MSC, Dde, etc. For sulfiir moieties, examples of suitable protecting groups 
include, but are not limited to, benzyl, 4-methylbenzyl, 4-methoxybenzyl, trityl, 
ACM, TACAM, xanthyl, disulfide derivatives, picolyl, and phenacyl. 

15 More particularly, the Na-substituted 2 or 3 carbon chain alkyl or aryl thiols 

can be prepared in accordance with Scheme I (Solid-Phase preparation of the Na- 
substituted precursor). Scheme n (Solution-Phase preparation of the Na-substituted 
precursor ). In Scheme I, Na-substituted 2 or 3 carbon chain alkyl or aryl thiols are 
assembled directly on the solid phase using standard methods of polymer-supported 

20 organic synthesis, while the Na-protected, N-alkylated, S-protected, aminoalkyl or 
arylthiol amino acid precursor of Scheme II are coupled to the resin using standard 
coupling protocols. Where racemic or diastereomeric products are produced, it may 
be necessary to separate tfiese by standard methods before use in extended native 
chemical ligation. 
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Scheme I 



Solid-Phase Synfliesis Of ECLAuxilliaryDeriviatized Molecules 





A 



R2 



OR 




OHC ^J2-resin 




NH 



^Jz-resin 




R=H 

X = halogen 
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TEe a-carboxythioesters can be generated by chemical or biological meAods 
following standard techniques known in the art, such as those described herein, 
including the Examples. For chemical synthesis, a-carboxythioester peptides can be 
synthesized m solution or from thioester-generating resins, which techniques are 
5 well known (See, e.g., Dawson et al., supra; Canne et aL, supra; Hackeng et aL, 
supra, Aimoto et aL), For instance, chemically synthesized thioester peptides can be 
made from die corresponding peptide a-thioacids, which in turn, can be synthesized 
on a thioester-resm or in solution, although the resin approach is preferred. The 
peptide-a-thioacids can be converted to the conresponding 3-carboxy-4-nitrophenyl 
10 thioesters, to tfie corresponding benzyl ester, or to any of a variety of alkyl thioesters. 
All of these thioesters provide satisfactory leaving groups for die ligation reactions. 
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with the 3-carboxy-4-nitrophenyI thioesters demonstrating a somewhat faster 
reaction rate than the corresponding benzyl thioesters, which in turn may be more 
reactive than the alkyl thioesters. As another example, a trityi-associated 
mercaptoproprionic acid leucine thioester-generating resin • can be utilized for 

5 constructing C-terminal thioesters (Hackeng et al, supra). C-terminal thioester 
synthesis also can be accomplished using a 3-carboxypropanesuIfonamide safety- 
catch Imker by activation with diazomethane or iodoacetonitrile followed by 
displacement with a suitable thiol (Ingenito et al,, supra; Bertozzi et aL). 

Ligation of the N-substituted 2 or 3 carbon chain alkyl or aryl thiol 

10 components , of the invention with the first carboxythioester component generates a 
ligation product having an N-substituted amide bond at the ligation site. The 
ligation conditions of the reaction are chosen to maintain the selective reactivity of 
the thioester with the N-substituted 2 or 3 carbon chain alkyl or aryl thiol moiety. In 
a preferred embodiment, the ligation reaction is carried out in a buffer solution 

15 haying pH 6-8, with the preferred pH range being 6.5-7.5. The buffer solution may 
be aqueous, organic or a mixture thereof. The ligation reaction also may include one 
or more catalysts and/or one or more reducmg agents, lipids, detergents, other 
denaturants or solubilizmg reagents and the like. Examples of prefened catalysts are 
thiol and phosphine containing moieties, such as thiophenol, benzyhnercaptan, 

20 TCEP and alkyl phosphines. Examples of denaturing and/or solubilizing agents 
include guanidinium, urea in water or organic solvents such as TFE, HFIP, DMF,' 
NMP, acetonitrile admbced with water, or with guanidinium and urea in Water. The 
temperature also may be utilized to regulate the rate of the ligation reaction, which is 
usually between 5°C and 55°C, with the preferred temperature being between IS'^C 

25 and 40'*C. As an example, the ligation reactions proceed well in a reaction system 
having 2% thiophenol in 6M guanidinium at a pH between 6.8 and 7,8. 

For the N-substituted 2 carbon chain alkyl or aryl thiols, the ligation event 
results from a thiol exchange that occurs between the COSR thioester component 
and the amino alkyl thiol component. The exchange generates a thioester-linked 

30 intermediate ligation product that after spontaneous rearrangement through a 5- 
membered ring intermediate generates a first ligation product of the formula Jl-HN- 
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CH(Zl)-C(0)-Na(Cl(Rl>C2-SH)-CH(Z2)-J2 having a removable N-substituted 2 
carbon chain allsyl or aryl thiol [HS-C2-C1(R1)-] at the ligation site, where the 
substituents are as defined above. The N-substituted 2 carbon chdn alkyl or aryl 
thiol [HS-C2-C1(R1)-] at the ligation site is amenable to being removed, under 

5 peptide-compatible conditions, to generate a final ligation product of the formula Jl- 
HN-CH(Zl)-CO-NH-CH(Z2)-CO-J2 having a native amide bond at the ligation site. 

For the N-substituted 3 carbon chain aryl or alkyl thiols, the thiol exchange 
between the COSR thioester component and the amino alkyl thiol component 
generates a thioester-linked intermediate ligation product that after spontaneous 

10 rearrangement through a 6-membered ring intermediate generates a first ligation 
product of the foraiula Jl-HN-CH(Zl)-C(0>Na(Cl<:2(R2)-C3(R3>SH)<:H(Z2)- 
J2 having a removable N-substituted 3 carbon chain alkyl or aryl thiol [HS-C3(R3)- 
C2(R2)-C1(R1)-] at the ligation site. The N-substituted 3 carbon chain aryl thiol 
[HS-C3(R3)-C2(R2)-C1(R1)-] at the ligation site is amenable to being removed, 

15 under peptide-compatible conditions, to generate a final ligation product of the 
formula Jl-HN-CH(Zl)-CO-NH-CH(Z2)-CO-J2 having a native amide bond at the 
ligation site. 

Removal of the N-substituted alkyl or aryl thiol group is preferably 
perfbrmed in acidic conditions to &cilitate cleavage of die N-Cl bond, jnielding a 

20 stabilized, unsubstituted amide bond at the ligation site. By '"peptide-compatible 
cleavage conditions" is intended physical-chemical "conditions compatible with 
peptides and suitable for cleavage of the alkyl or aryl thiol moiety from the ligation 
product I'eptide-compatible cleavage conditions in general are selected depending 
on die a-substituted compound employed, whidi can be readily deduced through 

25 routine and well known approaches (See, e.g., "Protecting Groups in Organic' 
Synthesis", 3rd Edition, T.W. Greene and P.GJM. Wuts, Eds., John Wiley & Sons, 
Inc., 1999; NovaBiochem Catalog 2000; "Synthetic Peptides, A User's Guide;" GA. 
Grant, Ed., WJi. Freeman & Company, New York, NY,1992; "Advanced Chemtech 
Handbook of Combmatorial & Solid Phase Organic Chemistry," WD.. Bennet, J.W. 

30 Christensen, L.K. Hamaker, M.L. Peterson, M.R.Rhodes, and HJi. Saneii, Eds., 
Advanced Chemtech, 1998; *Triciples of Peptide Synthesis, 2nd ed.," M. 
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Bodanszky, Ed., Springer-Verlag, 1993; "The Practice of Peptide Synthesis, 2nd 
ed.," M. Bodanszky and A. Bodanszky, Eds., Springer-Verlag, 1994; and 'Trotecting 
Groups," PJ. Kocienski, Ed., Georg Thieme Verlag, Stuttgart, Germany, 1994). 

For example, where flie Rl, R2 or R3 substituents comprises a methoxy, 

5 hydroxyl, thiol or thiometiiyl, methyl and the like, the more universal method for 
removal involves acidic cleavage conditions typical for peptide synthesis 
chemistries. This includes cleavage of the N-Cl bond under strong acidic conditions 
or water-acidic conditions, with or without reducing reagents and/or scavenger 
systems (e.g., acid such as anhydrous hydrogen fluoride (HF), triflouroacetic acid 

10 (TFA), or trimethylsulfonyl flouroacetic acid (TMSFA) and the like). More specific 
acidic cleavage systems can be chosen to optimize cleavage of the Na-Cl bond to 
remove the aryl or alkyl thiol moiety for a given construct. Such conditions are well 
known and compatible with maintaining the integrity of peptides. A thiol scavenger 
may be included, particularly where tryptophans are present in a peptide or 

15 polypeptide sequence to avoid reaction of the tryptophan side chain with the 
liberated aryl or alkyl thiol moiety. Examples of thiol scavengers include ethanediol, 
cysteine, beta-mercaptoethanol and thiocresol. 

Other specialized cleavage conditions include light or reductive-cleavage 
conditions when the picolyl group is the substituent. As an example, when the Rl, 

20 or R2 and R3 substituents comprise a picolyl nioiety, photolysis (e.g., ultraviolet 
light), zinc/acetic acid or electrolytic reduction may be used for cleavage following 
standard protocols. Where Rl of the N-substituted 2 carbon cham thiol comprises a 
thiomethane at Rl, the mercury or HF cleavages can be used. The cleavage system 
also can be used for simultaneous cleavage from a solid support and/or as a 

25 deprotection reagent when the first or second ligation components comprise other 
protecting groups. 

In one embodiment of the present mvention Na-substituted 2 or 3 chain alkyl 
or aryl thiols are employed in the peptide synthesis step (particularly in automated 
peptide synthesis and orthogonal and convergent ligation strategies) to yield properly 
30 N-termmally derivatized polypeptides that can be used as substrates for extended 
chemical ligation to yield synthetic bioactive proteins. Such compounds comprise a 
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fiilly protected, partially protected or fiilly unprotected acid stable Na-substituted 2 
or 3 carbon chain amino alkyl or aiyl thiol of the formula (PG1)S-C2-C1(R1)- 
Na(PG2)-CH(Z2)-C(0)-J2 or (PGl)S-C3(IG)-C2(R2)-Cl(Rl)^Na(PG2)-CH(Z2)- 
C(0)-J2, which are depicted below in Table IV and Table V. In particular, one or 

5 more of Rl, R2 and R3 comprises an electron donating group conjugated to CI that, 
following conversion of the Na-substituted amino allcyl or aryl thiol to an Na- 
substituted amide alkyl or aryl thiol, is capable of forming a resonance stabilized 
cation at CI that facilitates cleavage of the Na-Cl bond under peptide compatible 
cleavage conditions. PGl and PG2 are protecting groups that are present 

10 individually or in combination or are absent and can be the same or different, where 
Z2 is any chemical moiety compatible with chemical peptide synthesis or extended 
native chemical ligation, and where J2 is any chemical moiety compatible with 
chemical peptide synthesis or extended native chemical ligation. PGl (or XI) is a 
group for protecting flie amine. PG2 (or X2) is a group for protecting the thiol. 

15 Many such protecting groups are known and suitable for this purpose (See, e.g., 
*Trotecting Groups in Organic Synthesis", 3rd Edition, T.W. Greene and P.GM. 
Wuts, Eds., John Wiley & Sons, Inc., 1999; NovaBiochem Catalog 2000; "Synthetic 
Peptides, A User's Guide,'* G A. Grant, Ed., W.H. Freeman & Company, New York, 
NY,1992; "Advanced Chemtech Handbook of Combinatorial & Solid Phase Organic 

20 Chemistry," W.D.. Bennet, J.W. Christensen, L.K. Hamaker, M.L. Peterson, 
M.R.Rhodes, and H.H. Saneii, Eds., Advanced Chemtech, 1998; 'Triciples of 
Peptide Synthesis, 2nd ed.," M. Bodanszky, Ed., Springer-Verlag, 1993; 'The 
Practice of Peptide Synthesis, 2nd ed.," M. Bodanszky and A. Bodahszlg', Eds., 
Springer-Verlag, 1994; and *Trotecting Groups," P. J. Kocienski, Ed., Georg Thieme 

25 Verlag, Stuttgart, Germany, 1994). 
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TablelV 
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Examples of preferred protecting groups for PGI and XI include, but are not 
limited to [Boc(t-Butylcarbamate), Troc(2,2,2,-Trichloroethylcarbamate), Fmoc(9- 
Fluorenylmethylcarbamate), Br-Z or Cl-Z(Br- or Cl-Benzylcarbamate), Dde(4,4,- 
dimethyl-2,6-dioxocycloexl-ylidene), MsZ(4-Methylsulfinylben2ylcarbamate), Msc 

5 (2-Methylsulfoethylcarbamate) Nsc(4-nitrophenylethylsulfony]-ethyloxy-carbonyl]. 
Preferred PGI and XI protecting groups are selected from "Protective Groups in 
Organic Synthesis " Green and Wuts, Third Edition, Wiley-hterscience, (1999) with 
the most preferred being Fmoc and Nsc. Examples of preferred protecting groups 
for PG2 mclude, but are not limited to [Acm(acetamidomethyl), MeoBzl or Mob(p- 

10 Methoxybenzyl), Meb(p-Methylbenzyl), Trt(Trityl),Xan(Xanthenyl),tButhio(s-t- 
butyl),Mmt(p-Methoxytrityl),2 or 4 Picolyl(2 or 4 pyridyl)),Fm(9-Fluorenyl- 
methyl),tbut(t-Butyl),Tacam(Trimethylacetamidomethyl)] . Preferred protecting 
groups PG2 and X2 are selected from "Protective Groups in Organic Synthesis," 
Green and Wuts, Third Edition, Wiley-Interscience, (1999), with the most preferred 

15 being Acm, Mob, MeB, Picolyl . 
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PGi Na — CH — C- 



HS, 



R3 R2 



Table V 
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The protected forms of the Na-substituted 2 or 3 chain alkyl or aryl thiols of 
the invention can be prepared as in Schemes I and n above. 

The compounds of the present invention may be produced by any of a variety 
of means, including halogen-mediated amino edkylation, reductive amination, and by 
the preparation of Na-protected, N-alkylated, S-protected, amino alkyl- or aryl- thiol 
amino acid precursors compatible with solid phase amino acid synthesis methods. 
When desirable, resolution of the racemates or diastereomers produced to give 
compounds of acceptable chiral purity can be carried out by standard methods. 



n. The Bioactive Peptides and Proteins of the Present Invention 

10 A. Synthetic Bioactive Proteins 

The above-described methods can be used to synthesize synthetic bioactive 
peptides and proteins. As used herein, a bioactive protein is said to be "synthetic" if 
non-recombinant technology has been employed to polymerize some, and most 
preferably all, of its amino acid residues. The term "non-recombinant technolog/' is 
IS intended to distinguish technologies that involve organic chemistry and other 
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synthetic polymerization approaches from technologies involving the translation of 
RNA into protein, either in vivo or in vitro. Most preferably, the synthetic bioactive 
proteins of the present invention will be produced m vitro using the principles of 
organic chemistry, and in particular the methods of "native chemical ligation" and 

5 "extended native chemical ligation," as discussed herein. Such chemical synthesis 
will preferably be conducted using a convergent synthetic approach in which 
polypeptide fragments will be synthesized and then ligated to one another to form 
the synthetic bioactive protein. Alternatively, the synthetic bioactive proteins of the 
present invention may be synthesized in a single non-convergent synthesis. 

10 As indicated above, the synthetic bioactive proteins of the present mvention 

has substantial molecular weight, preferably being greater than about 25 IcDa. For 
the purposes of the present invention, such determinations of molecular weight are to 
be made by denaturing SDS polyacrylamide electrophoresis. The term "monomer 
molecular weight" is intended to refer to the molecular weight of a monomer 

15 synthetic protein, as distinguished from synthetic proteins that may possess muhiple 
copies of a protein or polypeptide. 

As used herein, a synthetic protein is said to have "biological activity" if it 
possesses a discernible bioactivity that is dependent upon the synthetic protein's 
structure or amino acid sequence, such the variation in such structure or sequence 

20 enhances, modifies, or attenuates the protein's bioactivity. Without limitation, such 
"biological activity*' includes the ce^acity of the protein to mediate a catalytic, 
signaling or inducing reaction. As used herem, a protein is said to ''mediate a 
catalytic reaction" by converting a substrate into a product without itself being 
consumed or permanently modified. A protein is said to "mediate a signaling or 

25 inducing reaction" if its presence causes an organism, or its tissues or cells, to 
initiate, continue, enhance, modify, or attenuate gene expression, cellular 
differentiation, or cellular proliferation. Examples on such signaling or inducing 
reactions mclude inducing cytokme expression or a response to cytokine expression, 
inducing erythropoiesis, inducing or attenuating inflammation and/or an 

30 inflammatory process, initiating angiogenesis or vascularization, inducing apoptosis, 
afBecting the cell cycle, etc. 



wo 02/20034 



PCT/USOl/21935 



-45- 

The bioactivity of the synthetic bioactive proteins of the present invention 
also includes the capacity of the protein to bind specifically to a receptor, ligand or 
antibody. As used herein, the term "specific" binding is intended to refer to a 
structurally-based binding interaction, such as that existing between a honnone and 

S its receptor, or an antibody and an antigenic epitope, as distinguished f:om "non- 
specific" binding based upon charge, solubility, hydrophobicity, etc. 

The synthetic bioactive proteins of the present invention are preferably 
synthesized by the condensation of amino acid residues. Such amino acid residues 
may be the nucleic acid encoded, ribosomally installed amino acids: alanine, 

10 arginine, asparagine, aspartate, cysteine, glutamate, glutamine, glycine, histidine, 
isoleucine, leucine,, lysine, methionine, phenylalanine, proline, serine, threonine, 
tryptophan, tyrosine and valine, hi one embodiment, the amino acid sequence 
selected for a particular desired synthetic bioactive protein would be the native, or 
natural sequence of that protein, hi an alternative embodiment, the selected amino 

15 acid sequence will be composed of, all constructed from polypeptide fi-agments that 
may contain an N-terminal cysteine residue in place of the amino acid residues 
present in the native or natural sequence of that protein. The inclusion of such a 
residue permits the polypeptide to be ligated to another polypeptide (modified to 
contain a carboxy thioester group) using the principles of native chemical ligation 

20 described here and, and as such, facilitates the use of convergent synthesis to 
produce synthetic bioactive proteins. 

In a fiirther embodiment, the synthetic bioactive proteins of the present 
invention may contain "irregular" ammo acid residues. As used herein, the term 
"irregular amino acid residues is intended to refer to . amino acids that are not 

25 encoded by RNA and are not ribosomally installed. In this regard, the present 
invention permits wide selectability and flexibility in the design and/or construction 
of synthetic bioactive proteins. Examples of non-ribosomally installed amino acids 
that may be used in accordance with a present invention include: D-amino acids, B- 
amino acids, pseudo-gjutamate, y-aminobutyrate, ornithine, homocysteine, N- 

30 substituted ammo acids (EL Sunon et al, Proc. Natl. Acad. Sci. U.SA. (1992) 89: 
9367-71; WO 91/19735 (Bartlett et a/.), U.S. Patent 5,646,285 (Baindur), a- 
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aminomethyleneoxy acetic acids (an amino acid-GIy dipeptide isostere), and a- 
aminooxy acids, etc. Peptide analogs containing thioamide, vinylogous amide, 
hydrazine, methyleneoxy, thiomethylene, phosphonamides, oxyamide, 
hydroxyethylene, reduced amide and substituted reduced amide isosteres and 6- 

5 sulfonamide(s) may be employed. 

In particular, the use of pseudo-glutamate is advantageous since the R chain 
modification permits attachment of polymer adducts to the synthesized protem. 
Likewise, N-terminal Na-substituted 2 or 3 carbon chain alkyl or aryl thiol amino 
acids may be employed. Such residues (where present at the end terminus or 

10 polypeptide) can be advantageously used to ligate that polypeptide to a polypeptide 
havuig a carboxy thioester moiety, in accordance with the methods of extended 
native chemical ligation described herein. 

In one embodiment, all of the amino acid residues of the synthetic bioactive 
protein may be joined together by a peptide bond (i.e., an amide bond). 

15 Alternatively, two amino acid residues (or the C-terminal and N-terminal residues of 
two polypeptides) may be linked to one another by a non-amide bond (such as a 
thioester bond, an oxime bond, a thioether bond, a directed disulfide bond, a 
thiozolidine bond, etc.) (Schnolzer, M. and Kent, S.B.H., Science (1992) 256:221- 
225; Rose, K., J. Amer. Chem Soc. (1994) 116:30-33; Englebretsen, D.R. et al, 

20 Tetrahedron Lett. 36:8871-8874; Baca, M. et al, J- Amer. Chem Soc. (1*995) 
117:1881-1887; Liu, C.F. et al, J. Amer. Chem Soc. (1994) 116:4149-4153; Liu, 
C.F. et aU J. Amer. Chem Soc. (1996) 118:307-312; Dawson, P.E. et al (1994) 
Science 266:776-779). The invention thus permits a variety of peptide bond 
modifications, surrogates and isosteric replacements to be exploited in the 

25 preparation of bioactive protems. 

The synthetic bioactive proteins of the present invention may be designed to 
possess a bioactivity that mimics a bioactivily associated with a mammalian 
(including human, simian, bovine, murine, porcme, ovine, equine, etc.), avian, or 
piscine protein. As used herein, a first protein is said to mimic a second protein if 

30 the first protem possesses a bioactivity that is modified, enhanced, or attenuated with 
respect to the bioactivity of the second protein. A bioactivity is said to be 
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"associated" with the protem if its presence is directly or indirectly dependent upon 
the amino acid sequence of the protein. 

In alternative embodiment, the bioactive proteins of the present mvention 
may be wholly or partly engineered, without reference to any corresponding natural 
5 bioactive counterpart. Preferred synthetic bioactive proteins of the present mvention 
mclude receptors, protein receptor ligands. Examples of such proteins mclude 
adrenocorticotropic hormone receptor and its bioactive fragments, angiotensin 
receptor, atrial natriuretic receptor, bradykinmin receptor, growth hormone receptor, 
chemotatic receptor, dynorphin receptor, endorphin receptor, the receptor for P- 
10 lipotropin ancj its bioactive fragments, enkephalin receptor, enzyme mhibitor 
receptors, the receptor for fibronectin and its bioactive fragments, gastrointestinal- 
and growth hormone-releasing peptide receptors, the receptor for luteinizing 
hormone releasing peptide, the receptor for melanocyte stimulating hormone, 
neurotensin receptor, opioid receptor, oxytocin receptor, vasopressin receptor, 
15 vasotocin receptor, the receptor for parathyroid hormone and fragments, protein 
kinase receptor, somatostatin receptor, substance P receptor. 

The synthetic bioactive proteins of the present invention they also include 
enzymes, and structural molecules, such as chitin or collagen, etc. 

In a further embodiment, the synthetic bioactive proteins of the present 
20 invention include cytokines. Cytokines are small proteins or biological factors (in 
the range of 5-20 kDa) that are released by cells and mediate specific ejffects on cell- 
cell interaction, conununication and on the behavior of other cells. As used herein, 
the term "cytokme" is mtended to mclude the interleukins ("IL) (such as IL-1, IL-2, 
IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, DL-IO, DL-ll, IL-12, IL-13, IL14, etc.); 
25 lymphokines and signaling molecules such as eiythropoiesis stunulating proteins 
(e.g., erythropoietm (BPO)), tumor necrosis factor (TNF), mterferons, etc, growth 
factors, such as transformmg growth fector, nerve growth factor, bram derived 
growth fector, neurotrophin-3, neurotrophin-4, heptaoq^e growth factor, T-cell 
Growth Factor (TGF, TGF-pi, TGF-p2, TGF-P3, etc.), Colony Stunulating Factors 
30 (G-CSF, GM-CSF, M-CSF etc.). Epidermal Growth Factor (EGF, LIF, KGF, GSM, 
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PDGF, IGF-I, etc.). Fibroblast Growth Factor (oFGF, jJFGF, etc.), and hormones, 
particularly single peptide hormones, such as growth hormone). 

Cytokines, and particularly chemotactic cytokines, also comprise particularly 
preferred examples of proteins whose amino acid sequence can be used to aid in the 
5 design of synthetic bioactive proteins within the ambit of the present invention. 
Cytokines participate in innate and acquired immune responses by either directly or 
indirectly mediating leukocyte recruitment, or by serving as important signals in the 
activation of leukocytes that have accumulated at specific sites of microbial 
invasion. The pro-inflammatory cytokines, interleukin-1 6 (IL-1 B) and interleukin-6, 

10 (IL-6) arid mediators reflect neutrophil recruitment and activation, including soluble 
intercellular adhesion molecule, interleukin-8 (IL-8) (Kotecha S., European Journal 
of Pediatrics (1996) 155 Suppl 2:S14-17). Interleukin-6 is a multifunctional 
cytokine primarily involved in the acute-phase response, B cell differentiation, and 
antibody production. (Akira S, et ah Interleukin-6 in biology and medicine. Adv 

15 Immunol (1993) 54:1-78). This cytokine is synthesized and secreted by many 
diflFerent cells, including monocytes, macrophages, T and B cells, endothelial cells, 
and fibroblasts, IL-6 is often induced with the proinflammatory cytokines TNFalpha 
and IL-1 in many alarm conditions, and circulating IL-6 plays an important role in 
the induction of acute phase reactions. (Xing Z, et aL, J. Clin» Invest. (1998) 

20 101(2):311-20) Cytokines that directly or faidirectly influence both leukocyte 
recruitment and activation include TNF-alpha, granulocyte-colony-stimulating factor 
(G-CSF), granulocyte-macrophage-colony-stimulating factor (GM-CSF), and 
members of the chemokine femily. Cytokines that predominantly regulate the state 
of leukocyte activation include interferon-ganMna (IFN-gamma), interleukin-1 0 (JL- 

25 10), and interleukin-12 (IL-12). Cytokines can also influence microbial clearance by 
inducing or regulating the expression of other effector molecules, which can occur in 
either an autocrine or paracrine fashion. 

Colony-stimulatmg factors are cytokines produced by a variety of myeloid 
and stromal cells that are required for the proliferation and maturation of 

30 hematopoietic stem cells. Additionally, studies show that these factors augment the 
effector cell activities of mature leukocyte populations. Specifically, G-CSF 
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prolongs neutrophil survival in vitro^ augments neutrophil phagocytic activity, and 
enhances neutrophil respiratory burst. (Standiford TJ, et aU J. Invest. Med. (19097) 
45:335-345). In contrast, GM-CSF promotes the proliferation and maturation of 
both neutrophils and macrophages; the activating properties of this cytokine are 

5 primarily directed toward mature macrophage populations. (Chen GH, et aly J 
Immunol (1994) 152:724-734). 

Six closely related subfamilies of chemotactic cytokines, referred to as 
chemokines, have now been characterized. Of these, members of at least two 
subfamilies contribute to pulmonary antimicrobial host defense (Mandujano J, et al., 

10 Amer. J. Respir. Crit. Care Med. (1995) 151:1233-1238); Baggiolini M, et al, Ann. 
Rev. Immunol. (1997) 15:675-705). Members of the C-X-C chemokme family, 
which include IL-8, MEP (macrophage inflammatory protein)-2, KC, ENA-78, and 
NAP-2, have predominant neutrophil stimulatory and chemotactic activities, 
whereas the C-C family, which includes MCP (monocyte chemoattractant protein)-!, 

15 MCP-2, MCP-3, RANTES, MIP-1 alpha, and MIP-1 beta, exerts predominant 
chemotactic and/or activating effects on macrophages, lymphocytes, and eosinophils. 
(Huftiagle GB, et a/., The Role Of Chemokmes In Pneiimonia, in Koch A, Stricter 
R, eds. Chemokines in Disease. Texas: R.G. Landis Co; 1996:151-168) 

Interferon-gamma is a cytokine produced by T cells (both alpha beta- and 

20 gamma delta- T cells) and NK cells that is instrumental in cell-mediated immunity 
against a broad spectrum of pulmonary pathogens. This cytokine activates several 
macrophage effector cell functions, including stimulation of resputttory burst, the 
ability to present antigen, priming of macrophage-derived TNF release, and 
enhanced in vitro macrophage antimicrobial activity against bacterial, fimgal, and 

25 mycobacterial organisms. 

InterIeukin-12 is a cytokine that promotes Thl-type immune responses, while 
mhibiting Th2-type inmiune responses. Specifically, IL-12 stimulates the 
development of Thl T cells and NK cells, and increases the cytolytic activity of 
CD8+ cells and NK cells. Most important, IL-12 serves as the major inducer of IFN- 

30 gamma from T cells and NK cells. 



wo 02/20034 



PCT/USOl/21935 



-50- 

In contrast to IL-12, E.-10 promotes the development of Th2-type immune 
responses, while inhibiting cell-mediated (Thl-type) immuni'fy. (Howard M, et al, J. 
Clin. Immunol. (1992) 12:239-247). This cytokine was shown to exert potent anti- 
inflammatory properties, in part by directly de-activating neutrophils and 
5 macrophages, and by downregulating the expression of TNF, IFN-gamma, and 
members of bolh the C-X-C and C-C chemokine families. IL-10 is instrumental in 
attenuating undesired production of proinflammatory cytokines in states of systemic 
immune cell activation, 

EPO is a particularly preferred example of a protein whose amino acid 

10 sequence can be used to aid in the design of a synthetic bioactive protein having 
erythropoieis-stimulating activity. EPO is the principal factor responsible for the 
regulation of red blood cell production during steady-state conditions and for 
accelerating recovery of red blood cell mass following hemorrhage (Jelkmann, W. 
(1992) Physiol. Reviews 72:449; Krantz, S.B. (1991) Blood 77:419; Porter, D.L. and 

15 M.A. Goldberg (1993) Exp. Hematol. 21:399; Nissenson, A.R. (1994) Blood Purif. 
12:6). The circulating form of human EPO is a 165 amino acid (aa) glycoprotein 
with a molecular weight of approximately 30,000 (Sawyer, S.T. et al. (1994) 
Hematol. Oncol. Clinics NA 8:895; Jacobs, K.J. et al Nature (1985) 313:806; Lin, 
F-K. et al Proc. Natl. Acad. Sci. USA (1985) 82:7580). Although the cDNA for 

20 EPO predicts a molecule with 166 amino acid residues, the carboxy-terminal 
arginine is removed in a post-translational modification (12). There are three 
potential sites- for N-linked glycosylation and all are filled. One 0-linked 
carbohydrate moiety is also present (13). The effects of glycosylation are complex. 
Although unglycosylated E. coli-derived EPO shows fiill biological activity in vitro, 

25 glycosylation is apparently necessary for full activity m vivo. Thus, E. coli-produced 
and deglycosylated, naturally derived EPO show very low activity in animal studies 
(Krantz, S.B. (1991) Blood 77:419; Sasaki, H. et al (1987) J. Biol. Chem. 
262:12059; Lowy, P.H. et al (1960) Nature 185:102; Wojchowski, D.M. et al 
Biochem. Biophys. Acta 910:224; Dordal, M.S. et al (1985) Endocrinology 

30 116:2293), 
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Consistent with tiie above, variable glycosylation patterns show variable 
effects. For example, desialyated EPO exhibits both enhanced in vitro and decreased 
in vivo activity, an effect attributed to the exposure of galactose residues which are 
recognized, bound, and cleared by hepatocytes (Spivak, J.L. and B.B. Hogans (1989) 

5 Blood 73v:90; Goldwasser, E. et al (1974) J. Biol. Chem. 249:4202). The branching 
pattern of fully sialyated EPO also makes a difference in biological activity. 
Predominantly tetra-antennary branched EPO shows activity equivalent to 
"standard" EPO, while predominantly bi-antemnaiy branched EPO shows three-fold 
more activity in vitro but only 15% of normal activity in vivo (Takeuchi, M. et al 

10 (1989) Proc. Natl. Acad. Sci. USA 86:7819). Studies have indicated that only N- 
linked, and not O-linked sugars, are important in EPO functioning (Higuchi, M. et 
al (1992) J. Biol. Chem. 267:770319). 

, Colony-stimulating factors and RANTES also comprise particularly preferred 
examples of proteins whose amino acid sequence can be used to aid in the design of 

15 a synthetic bioactive protein within the ambit of the present invention, 

B. Polymer-Modified Proteins and Polypeptides 

A second attribute of the present invention concerns the ability to modify a 
protem or polypeptide to contain one or more structurally defined polymer adducts at 
preselected positions. Although polymer-modified proteins have been previously 

20 described, the nature and manner of the possible polymer-modifications of flie 
present invention markedly differ fi*om such prior efforts. 

Since the synthetic bioactive proteins of the present invention are chemically 
synthesized, in whole or part, it is possible to synthesize such proteuis in a manner 
that will permit one to covalently bond polymers (such as polymers or sugar, 

25 polyethylene glycol, glycols, hyaluronic acid, etc.) to one or more particular, user- 
selected and user-defined sites in a polypeptide a protein backbone. Moreover, the 
synthetic production of the proteins in the present invention permits one to ensure 
that such modifications are present at each of the user-selected and user-defined sites 
of every molecule in a preparation. Such uniformity and control of synthesis 
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markedly distinguishes the present invention from the random modifications 
permitted to the use of the methods of the prior art. Significantly, the present 
invention permits one to design a synthetic protein in which any non-critical residue 
may be derivatized to contain a polymer adduct. Moreover, for each such user- 

5 selected and user-defined site, the user may define the precise linkage (amide, 
thioester, thioether, oxime, etc.) through which such adducts will be bonded to the 
polypeptide or protein backbone. Additionally, the particular polymer adducts 
desired to be present at a particular site may be varied and controlled, such that a 
final preparation is obtained in which every protein or polypeptide present contains 

10 precisely the same derivatized adducts at precisely the same derivatized sites. Thus, 
the present invention permits the formation of homogeneous preparations of 
polymer-modified polypeptide and proteins. 

In addition to providing a means for varying the position and number of 
attachment sites to which polymer can be bound, the present invention permits one 

15 to vary the nature of the bound polymer. The polymer adducts that may be 
incorporated into any particular user-selected and user-defined site can be of any 
defined length. Likewise, consistent with the methods of present invention, it is 
possible to employ polymers of different lengths at different sites. Thus, in one 
embodiment, the synthetic bioactive proteins of the present invention may be either 

20 mono-modified, or poly-modified, with a polymer adduct Where more than one 
polymer adducts introduced mto a particular polypeptide or protein, the employ 
polymers may be "mono-speciated," "poly-speciated," '^uniformly speciated/' or to 
"diversely speciated in." As used herein, the term ''mono-speciated" is intended to 
refer to a polypeptide or protein that has been modified by a single species of 

25 polymer. In contrast, the term ''poly-speciated" is mtended to refer to a polypeptide 
or protein that has been modified by more than a single polymer species. Such poly- 
speciated polypeptide or protein are said to be **uniformly-speciated" if, at each 
modified site of tfie polypeptide or protem, the same, single species of polymer is 
present. In contrast, a poly-speciated polypeptide or protein is said to be "diversely- 

30 speciated" if, the modified sites of the polypeptide or protein are modified with 
different species of polymer. 
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Moreover, it is possible to vary the ©ctent of linearity or branchedness of the 
polymer adduct at each of the user-selected in user-defined sites. Thus the polymer 
adducts may be linear, branched, or uniformly branched. Hie term '\mifonnIy 
branched" is intended to mean that all branches of a polymer at a particular site have 

5 the same structure and lengfli. As we appreciate, the present invention permits one 
to independently vary both the length of any individual branch, as well as the 
structure of the polymer present at such branchpomt. 

In sum, tihie present invention permits one to define (1) the location and (2) 
fi-equency of polymer-modified, user-selected and user-defined sites in a polypeptide 

10 or protein backbone, as well as to control the (3) length, (4) species, and (5) degree 
of branching present at each such site. Additionally, with respect to polypeptide and 
proteins having multiple polymer modifications, the present invention permits one to 
independently define each of the above-identified five variables for each site. 
Moreover, with respect polypeptides and proteins having branched polymer 

15 modifications, the present mvenfion permits one to independently define each of the 
above-identified five variables for each branchpoint. Thus, the present invention 
provides substantial flexibility of polymer modification 

C. Protein and Peptide Delivery 

The optionally polymer-modified synthetic bioactive polypeptides and 
20 proteins of the present invention may be employed as pharmaceutical agents to effect 
the treatment of diseases and conditions. Most preferably, when administered to a 
patient or individual in need of therapy, such synthetic bioactive polypeptides and/or 
proteins will be administered using a drug delivery system. The uses such a system 
enables a drug to be presented to the patient in a manner that makes it acceptable for 
25 them and enhances the effectiveness of the desired bioactivity. The purposes of the 
present mvenfion, preferred drug delivery systems include systems capable of 
administering polypeptides or proteins buyout oral, nasal, or inhalation routes, or 
intramuscularly, subcutaneously, transdermally, intravenously, intraurally or 
intraocularly. 
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Such drug delivery systems may include formulations that provide site- 
specific release, or that enhance protection for the intestinal mucosa. Suitable 
formulations mclude: dry powder formulations, delivery via viral particle 
encapsulation, liposome encapsulation, transdermal patches, electrically aided 
S transport (electroporation therapy) and polymer/niazone conjugation. 

In a preferred embodiment, such drug delivery devices will respond to 
changes in the biological environment and deliver — or cease to deliver — drugs based 
on these changes. A range of materials have been employed to control the release of 
drugs and other active agents.: poly(urethanes), poly(siloxanes), poly(methyl 

10 methacrylate), poly(vinyl alcohol) for hydrophilicity and strength, poly(ethylene), 
poIy(vinyl pyrrolidone), poIy(2-hydroxy ethyl methacrylate), poly(n-vinyI 
pyrrolidone), poly(methyi methacrylate), poly(vinyl alcohol), poIy(acryIic acid), 
polyacrylamide, poly(ethylene-co-vinyl acetate), poly(ethylene glycol), 
poly(methacrylic acid), etc. In a further preferred embodiment, biodegradeable 

15 polymers will be employed to facilitate drug delivery. Such polymers incude 
polylactides (PLA), polyglycolides (PGA), poly(lactide-co-glycolides) (PLGA), 
polyanhydrides, and polyorthoesters. Drug delivery devices, and methods for their 
use are described in U.S. Patents ; 6,072,041; 6,041,253; 6,018,678; 6,017,318; 
6,002,961; 5,879,712; 5,849,331; 5,792,451; 5,783,212; 5,766,633; 5,759,566; 

20 5,690,954; 5,681,811; 5,654,000; 5,641,511; 5,438,040; 4,810,499; and 4,659,558. 

in. The Production of Synthetic Bioactive Proteins 

The production of the synthetic bioactive proteins of the present invention 
can be envisioned as having the following steps or components: design, synthesis, 
peptide ligation, protein folding, assessment of bioactivity, polymer modification of 
25 the polypeptide backbone (Figure 2). 

A. The Design of Preferred Synthetic Bioactive Proteins 

In a particularly preferred embodiment, the synthetic bioactive proteins of the 
present invention will be an erythropoiesis stimulating protein. As used herein, an 
erythropoiesis stimulating protein is a protein that mediates the production of red 
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blood cells. The erythropoiesis stimulating bioactivity of a protein may be 
determined by any of a variety of means, such as by following erythropoiesis after in 
vivo administration, by assaying the capacity of the protein to mediate the 
proliferation of EPO-dependent cell lines, etc. 
5 A preferred erythropoiesis stimulating protein is mammalian, most preferably 

human, EPO and its analogs. Erythropoietin is predominantly synthesized and 
secreted by tubular and juxtatubular capillary endothelial and interstitial cells of the 
kidney. Chronical kidney disease causes the destruction of EPO-producing cells in 
the kidney. The resulting lack of EPO frequently induces anemias. The main 

10 clinical use of EPO is therefore the treatment of patients with severe kidney 
insufficiency (hematocrit below 0.3) who usually also receive transfusions as well as 
the treatment of anemic patients following chemotherapy. Typically, EPO is 
synthesized recombinantly in hamster ovary cells for clinical use. EPO is a relative 
heat- and pH-stable acidic (pi =4,5) protein of 165 amino acid residues in its mature 

15 form. EPO conveys its activity through the EPO-receptor. Approximately 40 
percent of the molecular mass of EPO is due to its glycosylation. Glycosylation is an 
important factor determining the pharmacokinetic behavior of EPO in vivo. Non- 
glycosylated EPO has an extremely short biological half-life. It still binds to its 
receptor and may even have a higher specific activity in vitro. However, recent 

20 experiments have shown that PEGylation of EPO significantly enhances its lifetime, 
but to significantly decrease EPO activity in a cell-based assay system. 

The synthetic erythropoiesis stimulating protebis ("SEPs") of the present 
invention are preferably chemically modified synthetic analogs of human urinaiy 
EPO. In a preferred embodiment, they contain one or more polymer moieties (most 

25 preferably pPEG moieties) attached to one or more peptide residue(s) through a 
thioether, oxime, amide or other linkage. In a highly preferred embodiment, such 
linkages will be at one or more of the positions that are naturally glycosylated in 
human urinary EPO (i.e., positions 24, 38, 83 and 126). Most preferably, the SEP 
molecules will have pPEG moieties at two or more of such positions. In an 

30 alternative embodiment, other protein residues may be modified by polymer 
moieties. The positions of modification for the synthetic bioactive proteins of the 
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present invention include residues located in a disordered loop, region or domain of 
the protein, or at or near sites of potential protease cleavage. For example, polymer 
modifications may be introduced at one or more positions of 9, 69 and/or 125 of 
EPO, The total molecular weight of the SEP molecules of the present invention may 
5 vary from about 25 to about 150 kDa, and more preferably from about 30 to about 80 
kDa. The molecular weight can be controlled by hicreasing or decreasing the 
number and structure of the polymer (such as pPEG) utilized for modification of a 
given analog. The pPEG mediated hydrodynamic MW for the larger constructs is 
greater than about 100 kDa. In vitro assays in cells expressing the human EPO 

10 receptor mdicate that SEPs of the present invention have an ED50 that is equivalent 
to that of recombinantly produced glycosylated human EPO. 

Oxime-linked pPEG analogs are preferably constructed by attaching an 
aminooxy, ketone or aldehyde functionalized pPEG to the SEP protein at a non- 
naturally encoded amino acid bearing a side chain aminooxy, ketone or aldehyde 

15 functionality. For example, positions 89 and 117 of SEP-0 and 1 contain pseudo 
glutamates (a non-naturally encoded amino acid bearing a side chain of the formula - 
CHa-CH2-S-COOH (as compared to glutamate side cham -CHa-CH2-CH2- 
COOH)). SEP analogs utilizing thioether linkages are preferably constructed to 
contain a thiol fimctionality provided by a cysteine or unnatural amino acid with a 

20 side cham bearing the thiol. Figure 3 depicts the basic structure of one type of 
preferred synthetic erythropoiesis stimulating proteins. 

In an alternative embodiment, the SEP molecules of the present invention 
may comprise "circularly permuted" EPO analogs m which the natural ammo and 
carboxy terminus of EPO have been relocated. Most preferably, such relocation will 

25 move the amino and carboxy termini to positions of low structural constraint, such 
as to disordered loops, etc. The disordered loop around positions 125 and 126 
(relative to the native EPO residue numbering system) is an example of such a 
relocation site. Most preferably, such SEPs will be disulfide free, and will be 
chemically modified to contain polymer moieties at preselected residues. 
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Altematively, the SEP molecules may have amino and carboxy termini 
relocated to a naturally occurring glycosylation site, or to other sites amenable to 
glycosylation, such as position 126 and 125. The SEP molecules may also include 
modifications of the amino and carboxy termini to eliminate or modify charge (such 

5 as by carboxy amidation, etc.). 

In a preferred example of such circularly p^muted molecules, new N- and C- 
termini are provided by positions 126 and 125, respectively. The natural disulfide- 
forming cysteines at positions 7, 29, 32 and 161 are preferably replaced by the non- 
naturally encoded amino acid, L-a-N-butyric acid (Aba), which cannot form 

10 disulfide bridges. Residues R166, E37 and V82 are preferably replaced with 
alanines to improve production. Also, an additional cysteine is preferably inserted 
between positions 1 and 166 relative to the native EPO number scheme, which is 
numbered below as '0'. The resulting molecule contains four cysteines (at positions 
126, 0, 38, and 83 (as read in the N- to C-terminal direction)), which are utilized as 

15 (1) ligation sites and (2) thioether-forming pPEG attachment sites. Optionally, a 
cysteine may replace A125 to provide an additional pPEG attachment site. The total 
molecular weight of such SEP molecules of the present invention may vary fix>m 
about 25 to about 150 kDa, and more preferably fi-om about 30 to about 80 kDa. 
The molecular weight can be controlled by increasing or decreasing the number and 

20 structure of tiie polymer (such as pPEG) utilized for modification of a given analog. 
The pPEG mediated hydrodynamic MW for the larger constructs is greater than 100 
kDa. Optional pPEG attachment sites are located at positions 125, 9, and 24 (as read 
in the N- to C-terminal direction). Additional SEP analog designs have alternative 
N- and C-termini in the disordered loop region, and/or truncate residues Srom the 

25 new N- and/or C-termini. The basic structure of preferred circularly permuted 
molecules is shown in Figure 4. 

In an alternative embodiment, the synthetic bioactive proteins of the present 
invention are a manunalian, most preferably human C-GSF. G-CSF induces the 
rapid proliferation and release of neutrophilic granulocytes to the blood stream, and 

30 thereby provides therq>eutic effect in fighting infection. The human GCSF protein 
is 174 amino acids in length (Patent: EP 0459630 (Camble,R. et a/.), and variants of 
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its sequence have been isolated. The protein has four cysteine residues and an O- 
glycosylation site at threonine position 133. 

In one embodiment of such a synthetic GCSF molecules, the amino acid 
sequence of the molecule will be altered, relative to llie sequence of native GCSF, so 
5 as to contain natural, or more preferably, non-naturally encoded, hydrophobic 
residues at one or more of positions 1, 2 or 3. Optionally, such molecules will be 
further modified to contain an additional natural, or more preferably, non-naturally 
encoded, hydrophobic residues at position 5 of GCSF, and/or at positions 173 and/or 
174. 

10 In a further preferred embodiment the synthetic GCSF molecules will be 

polymer-modified (preferably pPEG) at one or more of positions 63, 64 and/or 133 
and/or at one or more of positions 1 and 174. The pPEG polymers may be linear or 
branched, charged, uncharged, or mixed; and may have a molecular weight range of 
from about 5 to about 80 kDa, and mofe preferably, from about 40 to about 70 kDa 
15 pPEG total MW contribution depending on the number and structure of pPEGs 
utilized for modification. The hydrodynamic radius exhibits a larger MW effect in 
vivo (for example, a 10 kDa branched pPEG exhibits an effective MW of about 55 
kDa). The estimated pPEG mediated hydrodynamic MW is greater than 100 kDa. 

For analogs utilizing oxime Imkage, the pPEG is preferably attached to non- 
20 naturally encoded residues bearing a side chain aminooxy, ketone or aldehyde 
functionality. For analogs utilizing thioether linkage, the pPEG is preferably 
attached to a cysteine or unnatural amino acid with a side chain bearing a thiol. For 
analogs utilizing amide linkage, the pPEG is attached to a natural or unnatural amino 
acid bearing a reactive side chain amine. 
25 The basic structure of the synthetic bioactive GCSF proteins is shown in 

Figure 5. In the figure, "J" designates a non-naturally encoded residue having a 
hydrophobic side chain. 

The bioactivity of such synthetic bioactive GCSF proteins may be assayed by 
conventional means, such as by assaying for their cq)acity to mediate the 
30 proliferation of a fector-dependent human or mouse cell line (e.g., NFS60 cell Ime), 
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or by measuring neutrophil stimulation, half-life, and immunogenicity in vivo, either 
with or without a delivery system targeting >20 day half-life/release). 

In an alternative embodunent, the synthetic bioactive proteins of the present 
invention are a mammalian, most preferably human chemokme, RANTES. 
5 RANTES blocks entiy of M-tropic HIV strains through the primary receptor CCR5, 
and also down-modulates inflammation pathways involved in asthma, transplant 
rejection and wound healing (Kaluikovich A, et al, Immunol Lett. (1999) 68:281- 
287). The synthetic RANTES molecules of the present invention preferably differ 
fix)m the RANTES chemokine in being chemically modified such that: (1) the N- 

10 terminal serine found at position 1 of RANTES 1-68 is substituted with an n- 
nonanoyl C'NNY") group; and (2) the tyrosine at position 3 is substituted with a non- 
naturally encoded amino acid having a hydrophobic side chain. Such compounds 
have been found to be extremely potent, possessing ED50 in the pM range, 
compared to the mM range for recombinant "Met-RANTES 1-68". 

15 Potent RANTES analogs have been created having one or more additional 

changes to those noted above, such as replacement of the proline at position 2 with a 
non-naturally encoded amino acid having a hydrophobic side chain, and by the 
attachment of a fatty acid to the C-terminus of the molecule. Receptor specificity 
has been improved in combination with potency by modifications to the N-loop 

20 region corresponding to RANTES residues 12-20. In a further preferred 
embodiment, the synthetic RANTES molecules of the invention incorporate a 
polymer moiety (such as pPEG, or nonanoy ("NNY") or Y3X at their N- or C- 
terminus to unprove inter alia in vivo half-life. The pPEG moiety is preferably 
attached through an oxime Imkage to a non-naturally encoded amino acid that is 

25 introduced at position 66, 67, -68 or linker at position 68, with preference to position 
67. The fatty acid is preferably attached through an oxfane linkage (preferably 
through a linker) to residue 68, such as an amino-oxy modified amino acid di- or tri- 
peptide linker. Otiier attachment chemistries may alternatively be employed. The 
molecular weight of flie larger constructs of such synthetic RANTES analogs ranges 

30 fix)m about 25 kDa to about 45 kDa dependmg upon the nature and structure of the 
pPEG attachment, and for the larger constructs have a pPEG mediated 
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hydrodynamic MW of greater than 60 kDa. The structure of preferred synthetic 
RANTES analogs is shown in Figure 6. 

B. Defined and Specific Polymer Modification of the Synthetic 
Bioactive Proteins and Peptides 

5 A further aspect of the present invention relates to chemical moieties and 

polymers, in particular water-soluble polymers, and their use to modify proteins so 
as to yield protein drugs with improved properties relative to the unmodified protein. 
The anticipated beneficial effects of such modification include but are not limited to 
(a) improved potency (b) longer lifetime of the protein drug in the circulation due to 
10 increased proteolytic stability and reduced clearance rates (c) reduced 
immunogenicity and (d) possibly differential targeting compared to the unmodified 
molecule. 

As discussed above, ahihough polyediylene glycol has been employed to 
modify proteins (see, for example, U.S. Patent 6,027,720, Kuga et ah^ wiiich relates 

15 to the modification of lysine residues of G-CSF to permit the attachment of 
polyethylene glycol molecules via reactive amme groups), such modifications are 
associated with molecular heterogeneity, and molecular diversity. 

As used herein, the term "molecular heterogeneity" is intended to refer to a 
variation in the number of polymer molecules attached to each protein of a protein 

20 preparation. The term '^molecular diversity" is intended to refer to (a) a variation in 
the site(s) of the protem that are modified by the polymer, (b) a variation in the 
length of the polymer adducts of different sites of the same protein, or of the same 
site(s) in different proteins of a protein preparation, or (c) a variation in the extent 
and/or nature of any branching of the polymer adducts of different sites of the same 

25 protein, or of the same site(s) in different proteins of a protein preparation. As used 
herein, the term "molecularly homogeneous" is intended to refer to a preparation of a 
protein in which all of the protein molecules contain the amino acid sequence and 
the same polymer modifications at the same positions. A mixture of two or more 
"molecularly homogeneous" protein preparations is referred to herein as a 

30 "molecularly defined" preparation. 
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In accordance with this aspect of the invention, a solution to the above- 
identified problems of polymer heterogeneity, diversity, and unsuitability involves 
the production of a mm class of biocompatible polymers which combine the 
advantages of both polypeptides (precise length, convenient synthesis) and "pPEG" 
5 C*pr^ision PEG"), a flexible, amphiphilic, non-immunogenic, polymer not 
susceptible to proteases) Rose, K. et d, (U.S. Patent Application Serial No. 
09/379,297, herein incorporated by reference). This new class of biocompatible 
polymer has the formula: 

"[CO-X-CO-NH-Y-NHln- 

10 n is an integer, preferably from 1-100 and more preferably from 2-100, where X and 
Y are biocompatible repeat elements of precise structure linked by an amide bond. 
Preferably, X and Y will be divalent organic radicals lacking reactive functional 
groups or are absent and are the same or dilBferent, and can vary independently with 
each repeating unit (n). Preferably, when n=2, at least one of X or Y will be selected 

15 from the group consisting of a substituted, unsubstituted, branched and linear 
aliphatic and aromatic group. More preferably, at least one of X or Y the divalent 
organic radicals will be selected from the group consisting of phenyl, a Cj-Ciq 
alkylene moiety, a Ci-C|o alkyl group, a heteroatom-containing phenyl, a 
heteroatom-containmg Cj-Cio alkylene moiety, a heteroatom-containing C^-Cio 

20 alkyl group, and a combmation thereof. 

Particularly preferred pPEG moieties have the formulae: 

-{CO-(CH2)2"CO- NH-(CH2)3-(OCH2CH2)3-CH2-NH}n- 

where n preferably varies from 1-100 and more preferably from 2-100; or 

.{CX)-(CH2)2-C(>NH-(CH2)6->ffl-<:0-(CH2)2-CO-NH-(CH2)3^ 

25 where n preferably varies from 1-50 and more preferably from 2-50. 

Such pPEG moieties can be syntiiesized in any of a variety of ways. Such 
moieties are, however, preferably produced usmg a solid phase stepwise chain 
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assembly of units, rather than a polymerization process. The use of such an 
assembly process permits the moieties of a preparation to have a defined and 
homogeneous structure, as to their length, the nature of their X and Y substituents, 
the position(s) (if any) of branch points, and the length, X and Y substituents, and 
position(s) of any branches. 

Preferably, such moieties will be synthesized by steps such as: 

(a) acylating the amino or hydroxyl group of a compound of the formula 
Z-Q-support with a molar excess of a derivative of a diacid having 
the formula, HOOC-X-COOH, where Z is HjN- or HO-; Q is a linker 

or a target molecule; and the support is a solid phase, matrix or 
surface; 

(b) activating the free carboxyl group of the product of step(a); 

(c) aminoiysing the product of step (b) with a molar excess of a diamine 
having the formula, NH2-Y-NH2; and 

(d) optionally repeating steps (a) - (c) using HOOC-X-COOH and NH2- 
Y-NH2, where said X and Y are divalent organic radicals or are 
absent and are the same or different, and can vary independently with 
each of said optionally repeated units, and are the same or different 
from the X and Y substituents used in any of the previous acylating 
and aminoiysing steps. 

In preferred embodiments, 6-mers, 12-mers, 18-mers and 32-mers of above 
repeat unit are employed. Where desired, the repeat unit can be used, for example, 
in conjunction with the amino group of lysine to form branched pPEG structures. 
The pPEG may be attached to flie synthetic proteins of the present invention by a 
variety of chemistries, including thioether, oxime and amide linkage formation. 

In one embodiment, the solid phase stepwise chain assembly of units 
comprises: 
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Step 1: Couple protected or unprotected diacid to amino on resin to generate 
amide bond at linkage site (where PG is a protecting group that is present or absent 
depending on diacid employed): 

PG-OOC-X-COOH + NH2-Y-NH-Resin 
5 Step 2: Remove protecting group {PG) on resin, if present 

HOOC-X-CO-NH-Y-NH-Resin 
Step 3: Couple protected or unprotected diamine to carboxy on resin to 
generate amide bond at linkage site (where PG is present or absent depending on 
diamine employed) 
10 PG-NH-Y-NH + HOOC-X-CO-NH-Y-NH-Resin 

Step 4: Remove protecting group (PG) on resin, if present 

-NH-Y"NH-OC-X-CO-NH-Y-NH-Resin 
Step 5: Repeat steps 1-4 'n' times to add *n' units then cleave from resin 
-[C0-X-C0-NH-Y.NH]-[CO-X<:O.NH-Y-NH]-[C(>X<:0-^^ 

15 

As discussed, linear and branched pPEG constructs are preferred water- 
soluble polymers for attachment to the synthetic bioactive molecules of the 
invention. The pPEGs employed bear pendant groups that are charged or neutral 
under physiological conditions, and can be made to vary in attachment chemistry, 

20 length, branching and solubility dependmg on the pPEG structure one employs. As . 
noted above, preferred pPEGs of the invention comprise a water-soluble polyamide 
having the repeat unit -CO-X-CO-NH-Y-NH-, where one or both of X and Y 
comprises a water-soluble repeat unit, and most preferably a *PEG'-based repeat 
unit. Although oligoureas are in principle accessible using a simple two-step solid 

25 phase procedure, as shown below for the case of an amino resm, MH2-Resm, and a 
synunetrical diamine, NH2-Y-NH2: 

Activation with carbonyldiimidazole -> im-CO-NH-Resin 
Aminolysis with diamine NH2-Y-NH2 -> NH2-Y-NH-CO-NH-Resin 
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where these two steps may be repeated a number of times to give an 
oligourea with repeat unit -NH-Y-NH-CO-, this approach is less prefeired. This is 
because the yields of the above steps may be non-quantitative at room temperature, 
even with very large excesses of reagents and long reaction times. Accordingly, it is 
preferable to use a three-step solid phase procedure, shown below for the case of an 
amino resin, NH2-Resm, to form a polyamide. The Reagents HO2C-X-CO2H and 
NH2-Y-NH2 should be symmetrical in order to avoid isomeric products. 

Acylation with diacid HO2C-X-CO2H -> HO-CO-X-CO-NH-Resin 
Activation with carbonyldiimidazole -> im-CO-OCO-X-CO-NH-Resin 
Aminoiysis with diamme NH2-Y-ISIH2 ^ NH2-Y-NH-C0-X-C0-NH-Resin 

These three steps may be repeated in sequence a number of times to give a 
polyamide with repeat unit -NH-Y-NH-CO-X-C0-. The polymer contains a precise 
number of monomer units, X and Y can be varied independently at each step, and 
end-groups can be chosen at will. For example, by using succinic anhydride 
("Succ") for the acylation step and 4,7,10-trioxa-l,13-tridecanediamine (also 
referred to as 'TEDA" or 'TTD") for the aminoiysis step, TEG'-based polyamides 
are formed wherein X is -CH2CH2-, Y is -NH-CH2CH2CH2-(OCH2CH2)3-CH2- 
NH- and the repeat unit is -NH"CH2CH2CH2-(OCH2CH2)3-CH2-NH- 
COCH2CH2CO-. In spite of the fact that the procedure involves divalent reagents 
with no protecting groups, cross-luiking is not a problem when standard commercial 
peptide synthesis resins are used (Rose et al., (U.S. Patent Application Serial No. 
09/379,297); and Rose et al., J. Am. Chem. Soc. (1999) 121:7034), except as noted 
below for the case of branched Lys cores for making branched constructs. 

For example, branched pPEG constructs can be made in a similar manner as 
for the "chemobody" constructs described in Rose et al., (U.S. Patent Application 
Serial No. 09/379,297) and Rose, K. and Vizzavona, J. {1 Am. Chem, Soc. (1999) 
121:7034). Thus, branched pPEG constructs can be made to have a branching core 
such as a lysine branching core, which is coupled through oxime linkers to a 
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preferxed water-soluble polyamide such as -(COCH2CH2CO-NH-CH2CH2CH2- 
(OCH2CH2)3-CH2-NH)n-. For instance, oxime bonds can readily formed between 
an aminooxyacetyl group on a Lys side chain at the other extremity of the 
polyamide, and glyoxylyl groups on a lysine core, for example the tetrameric core 
5 (0=CHCO)4Lys2Lys-. Thus an oxime linker off of each lysine branch point can be 
prepared as the structure -Lys(C0CH20N=CHC0-)amide-. An alternative 
construction can place the oxime bond between the polyamide and the free pendant 
group of the polyamide, preferably using a monoprotected diamine and deprotection 
after coupling the polyamide, to generate a tetravalent branched constructed depicted 
10 below: 

[(><:HC0-(NH<:H2CH2CH2-[0CH2CH2]3-CH2-NH-C0CH2CH2C^^^ 

Oxime chemistry can be used m making not only dimeric and tetrameric 
branched constructs, but it also suitable for assembling octameric branched 
constructs (Rose, K., J. Am. Chem. Soc. (1994) 116:30; Rose, K. et al, Bioconj. 

15 Chem. (1996) 7:552). It is, of course, possible to use other chemistries for such 
purposes when* using such pPEG polyamides (See, e.g.. Figure 7). Moreover, 
polyamide formation may be incorporated into a synthetic scheme for peptide 
synthesis involving Boc or Fmoc chemistry, but when elaborating such schemes it 
must be borne in mind that the aminolysis step will remove the Fmoc group, and will 

20 remove formyl protection of indole if Boc chemistry is to be used. 

Accordingly, such pPEGs can be made to have various branching cores (e.g., 
Lysme branching core etc.) linked through a bond of choice (e.g., amide, thioether, 
oxime etc.) to a linear water-soluble polyamide (e.g., -(Succ-TTD)n-, etc.), where 
the free end of each linear water-soluble polyamide can each be capped with a 

25 desired pendant group (e.g., carboxylate, amino, amide etc.) to provide a desired 
charge. Moreover the linear and branched pPEGs of the present invention can be 
made to comprise a unique functional group for attachment to a synthetic protein 
bearing one or more unique and mutually reactive fimctional groups, and the pendant 
groups of the pPEGs will preferably be non-antigenic (See, e.g.. Figure 7). 
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AU publications and patent applications mentioned in this specification are 
herein incorporated by reference to the same extent as if each mdividual publication 
or patent application was specifically and individually indicated to be incorporated 
by reference. The discussion of the background to the mvention herein is included 
5 to explain the context of the invention. This is not to be taken as an admission that 
any of the material referred to was published, known, or part of the prior art or 
common general knowledge anywhere in Ihe world as of the priority date of any of 
the claims. Having now generally described the invention, the same will be more 
readily understood through reference to the following examples, which are provided 
10 by way of illustration, and are not intended to be limiting of the present invention, 
unless specified. 

Example 1 

Synthesis of Synthetic Erythropoiesis Stimulating Protein SEP-0 

A synthetic erythropoiesis stimulating protein (SEP) was synthesized. The 
15 sequence of the full-length synthesized protein (designated "SEP-0 (1-166)" is: 

APPRLICDSR VLERYLLEAK EAEKITTGCA EHCSLNEKIT 

VPDTKVNFYA WKRMEVGQQA VEVWQGLALL SEAVLRGQAL 

LVKSSQPWv)/P LQLHVDKAVS GLRSLTTLLR ALGAQKv|/AIS 

PPDAASAAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
20 CRTGDR(SEQ ID N0:1) 

where ^ denotes a non-native amino acid residue consisting of a cysteine 
that is carboxymethylated at the sulfhydryl group. The SEP-0 protein was 
synthesized in solution from four polypeptide segments: 

Segment SEP-0: 1 (GRFN 1711; composed of residues 1-32 of SEQ ID NO:l): 
25 APPRLICDSR VLERYLLEAK EAEKITTGCA EH-thioester 



Segment SEP-0:2 (GRFN 1712, composed of residues 33-88 of SEQ ID N0:1): 

CSLNEKITVP DTKVNFYAWK RMEVGQQAVE VWQGLALLSE 
AVLRGQALLV KSSQPW-thioester (where Cys^^ is Acm protected) 
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Segment SEP-0:3 (GRFN 1713, composed of residues 89-1 16 of SEQ ID NO:l): 
CPLQLHVDKA VSGLRSLTTL LRALGAQK-thioester (where Cys*^ is 
Acm protected) 

Segment SEP-0:4 (GRFN 1714, composed of residues 117-166 of SEQ ID NO:l): 
5 CAISPPDAAS AAPLRTITAD TFRKLFRVYS NFLRGKLKLY 

TGEACRTGDR-carboxylate (where the C-terminal cysteine (Cys^^^) 
carries a picolyl (pico) protecting group) 

The peptides SEP-0:1 and SEP-0:2 and SEP-0:3 were synthesized on a 
thioester-generating resin by the in situ neutralization protocol for Hoc (tert- 

10 butoxycarbonyl) chemistry and stepwise solid phase peptide synthesis (SPPS) usmg 
established SPPS, side-chain protection and thioester-resin strategies (Hackeng, et 
• al, PNAS (1999) 96: 10068-10073; and Schnolzer, et al., Int. J. Pept. Prot. Res., 
(1992) 40: 180-193)) on an ABI433A automated peptide synthesizer or by manual 
chain assembly, or ordered and acqmred from commercial vendors. For mstance, a 

15 standard set of Hoc SPPS protecting groups was used, namely: Arg(Tos); 
Asp(cHex); Cys(4MeBzl) & Cys(Acm); Glu(cHex); ffisPNP); Lys(ClZ); Ser(Bzl); 
Thr(Bzl); Trp(fcrmyl); Tyr(BrZ); Met, Asn, Gin were side-chain .unprotected. 
Segment SEP-0:4 was synthesized analogously on a -0CH2-Pam-resin. The 
peptides were deprotected and shnulteneously cleaved from the resin support using 

20 HF/p-cresol according to standard Boc chemistry procedure; however, for those 
peptides containing protecting groups not removed in HF/p-cresol, the protecting 
groups were retained. The peptides were purified by preparative C4 reverse-phase- 
high pressure liquid chromatography (HPLC). Fractions containing pure peptide 
were identified using ES-MS (electrospray ionization mass spectrometry), pooled 

25 and lyophUized for subsequent ligation. 

Step 1: Ligation #7 Segment SEP-0:4 and segment SEP-0:3 were dissolved 
m TFE at 15 mM concentration. Saturated phosphate buffer (pH 7.9) containmg 6 
M guanidinium chloride and 1% thiophenol was added, resulting in a clear solution 
of the peptide segments. After ligation, the ligation mix was added to a solution of 2 
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ml TFE (trifluoroethanol), 6 ml 6M guanidinium chloride, 100 raM phosphate 
containing 25% p-mercaptoethanol and incubated for 20 minutes. The solution was 
acidified with a solution of 15 mg/ml TCEP (tris(2-carboxyethyl)phosphmeJICl) in 
glacial acetic acid and loaded onto a preparative C4 reverse-phase HPLC column (1 
inch diameter). The peptides were tiien purified by preparative gradient reverse- 
phase HPLC. Fractions containing the desu^d ligated product SEP-OiAcm+SEP- 
0:3+SEP-0:4 were identified by ES-MS and pooled. 

Step 2: Acm-removal #7 For Aran removal, flie aqueous acetonitrile 
solution containing the pooled fiactions of SEP-0:Acm+SEP-0:3+SEP-0:4 was 
diluted Ix with HPLC grade water, and solid urea was added for a final 
concentration of 2 molar. A threefold molar excess (relative to the total expected 
cysteine concentration) of a 30 mg/ml Hg(acetate)2 solution in 3% aqueous acetic 
acid was added and tiie solution is stirred for one hour. The solution was tiien made 
20% in p-mercaptoethanol, loaded onto a semi-prq)arative reveree-phase HPLC 
15 column and purified with a step gradient Fractions containing the desired product 
SEP-G:3+SEP-0:4 were identified by ES-MS and lyophilized overnight. 

Step 3: Ligation #2 Equal amounts of SEP-0:3+SEP-0:4 and SEP-0:2 were 
jomtiy dissolved in neat TFE at 15 mM concentration. 250 mM Phosphate buffer 
(pH 7.5) containmg 6 M guanidinium chloride and 1% thiophenol was added, 

20 resulting in a clear solution of the peptide segments. After one day of li^tion, the 
ligation mfac was added to a solution of 10 ml TFE, 10 ml p-mercaptoethanol, 10 ml 
piperidine and 20 ml 6M guanidinium chloride, pH 4, and incubated for 20 minutes 
to remove any remaining protecting groups. The solution was acidified witii a 
solution of 15 mg/ml TCEP in 20% aqueous acetic acid, loaded onto a preparative 

25 reverse-phase HPLC column and purified with a linear gradient. Fractions 
containmg tiie desired ligated product SEP-^:Acm+SEP-0:2+SEP-0:3+SEP-0:4 were 
identified by ES-MS and lyophilized overnight. 

Step 4: Carboxymethylation SEP-0:Acm+SEP-0:2+SEP-0:3+SEP-0:4 was 
dissolved in TFE at 15 mM concentration. A two-fold excess (v/v) of 200 mM 
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Phosphate buffer (pH 7.9) containing 6 M guanidinium chloride was added, resulting 
in a clear solution of the peptide segment. A 25-fold excess of bromo-acetic acid 
dissolved m a minimum amount of methanol was added, and the solution was 
allowed to react for two hours. The solution was acidified with a solution of 15 
5 mg/ml TCEP in 20% aqueous acetic acid, loaded onto a preparative reverse-phase 
HPLC column and purified with a step gradient. Fractions containing the desired 
carboymethylated product SEP-0:Acm+SEP-0:2+SEP-0:3+sm>-0:4+Et w»e 
identified by ES-MS and pooled. 

Step 5: Picolyl Removal Zinc dust was activated in 2M HCl for 30 minutes. 

10 Peptide SEP-0:Acm+SEP-0:2+SEP-0:3+SEP-0:4+Et was dissolved in neat TFE at 
about 10 mg/ml concentration. The solution lyas diluted witii 4x (v/v relative to 
TFE) 6M guanidinium chloride, 100 mM acetate, pH 4, containing (freshly added) 
35 mg/ml L-metiiionine and 35 mg/ml dodecylsarcosine (i.e. sodium N- 
dodecanoylsarcosine). The solution was added to tiie activated Zn powder. The 

15 reaction was monitored at ~lhr intervals and is complete after five hours. After 
completion, the supernatant was removed and the remaining Zn powder washed 
twice for five minutes with 6M guanidinium chloride, pH 4, 100 mM acetate 
containmg 35 mg/ml L-metiuonine and 35 mg/ml dodecylsarcosine containing 20% 
TFE as well as once with flie same solution containing 20% P-mercaptoethanol. The 

20 combined product was acidified witii a solution of 15 mg/ml TCEP in 20% aqueous 
acetic acid, loaded onto a preparative reverse-phase HPLC column and purified with 
a step gradient. Fractions containing the desired modified product SEP- 
0:Acm+SEP-0:2+SEP-0:3+SEP-0:4+Et-Pico were identified by ES-MS and pooled. 

Step 6: Acm-removal #2 The pooled solution of SEP-0:Acm+SEP- 
25 0:2+SEP-0:3+SEP-0:4+Et-Pico was diluted 3x with HPLC grade water, and solid 
urea was added for a final concentration of 2 molar.- A tiueefold molar excess 
(relative to tiie total expected q'steine concentration) of a 30 mg/ml Hg(acetate)2 
solution in 3% aqueous acetic acid was added and tfie solution stirred for one hour. 
The solution was flien made 20% in p-mercaptoethanol, loaded onto a semi- 
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preparative reverse-phase HPLC column and purified with a step gradient. Fractions 
containing the desired product SEP-0:2+SEP-0:3+SEP-0:4+Et-Pico were identified 
by ES-MS, diluted 2x (v/v) with water containing 2x (w/w relative to peptide mass) 
DPC (dodecylphosphocholine) and lyophilized overnight 

5 Step 7: Ligation #3 Equal amounts of SEP-0:2+SEP-0:3+SEP-0:4+Et-Pico 

• and SEP-0:1 were jointly dissolved in neat TFE at 15 mM concentration and 250 
mM Phosphate buffer (pH 7.5) containmg 6 M guanidinium chloride is added. To 
the solution 1% thiophenol was added. After one day of ligation, the ligation mix 
was added to a solution of 10 ml TFE, 10 ml P-mercaptoethanol, 10 ml piperidine 

10 and 20 ml 6M guanidinium chloride, pH 4, and incubated for 20 mmutes to remove 
any remaining protecting groups. The solution was acidified with a solution of 15 
mg/ml TCEP m 20% aqueous acetic acid, loaded onto a preparative reverse-phase 
HPLC column and purified with a linear gradient. Fractions containing the desired 
ligated product SEP-0 (1-166): (SEQ ID N0:1) were identified by electrospray mass 

15 spectrometry, diluted 2x (v/v) with water containing 2x (w/w relative to peptide 
mass) dodecylsarcosine and lyophilized. 

Step 8 Folding: Full-length ligated peptide SEP-0 (1-166) was dissolved in 
200 mM Tris buffer (pH 8.7) containing 6 M guanidinium chloride and 20% TFE 
and a ten-fold molar excess (relative to Cys residues in protein) of cysteine. This 

20 solution was dialyzed ovemight against a solution of 200 mM Tris buffer (pH 8.7) 
containmg 3 M guanidinium chloride at room temperature. The solution was then 
dialyzed against a solution of 200 mM Tris buflfer (pR 8.7) containing 1 M 
guanidinium chloride at room temperature for 4 hours at 4 "C and finally against 10 
mM phosphate buffer (pH 7.0) for 4 hours at to yield the final folded product. 

25 Folding was verified by electrospray ES-MS and CD (cuxjular dichroism) 
spectrometry. 

Step 9 Purification: The folded polypeptide was concentrated 5x in 
centricon concentrator vials and loaded on to Resource S cation exchange column 
equilibrated at 10 mM phosphate, pH 7.0. The folded protein was eluted in a linear 
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salt gradient to 500 niM NaCl in 10 minutes. Fractions containing the desired folded 
product SEP-0 (1-166) were identified by SDS-polyacrylamide gel electrophoresis 
(SDS-PAGE), and frozen and stored at - 80'C. An analytical reverse-phase HPLC 
chromatogram and an ES-MS spectrum of the folded protein product as well as a CD 
5 spectrum demonstrated the presence of folded protein. 

Example 2 

Synthesis of Synthetic Erythropoiesis Stimulating Protein SEP-1-L30 

A second synthetic erythropoiesis stimulating protein (designated SEP-1- 
L30) was synthesized to contain oxime-forming groups at positions 24 and 126 of 
10 SEP-0. These groups were then used to form SEP-1-L30, in which linear (EDA- 
Succ-)i8 carboxylate (EDA = (4,7,10)-trioxatridecane-U13diamine, also called 
TTD; Succ = -CO-CH2CH2CO-) polymers have been joined to the protein backbone. 
The sequence of the full-length SEP-1 (1-166) is: 

APPRLICDSR VLERYLLEAK EAEK^ITTGCA EHCSLNEKIT 

15 VPDTKVNFYA WKRMEVGQQA VEVWQGLALL SEAVLRGQAL 

LVKSSQPWvj/P LQLHVDBCAVS GLRSLTTLLR ALGAQKlj/AIS 

PPDAAK*«AAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
CRTGDR (SEQ ID NO: 2) 

where Y denotes an non-native amino acid residue consisting of a cysteme 
20 tiiat is carboxymethylated at the sulfhydryl group, and where K!^ denotes a non- 
native lysine that is chemically modified at the e-amino group with an oxime linker 
group coupled to a designated water-soluble polymer through an oxime bond. 

A. Oximation of GRFN1776 and GRJN1711 with GRJWP32 

Oxhne formation was performed to attach water-soluble polymers bearing an 
25 aminooxyacetyl group to peptides carrying a ketone carbonyl group. To accomplish 
this, the following peptide segments were synthesized: 
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Segment SEP-l:4 (GRFN 1776; composed of residues 1 17-166 of SEQ ID NO:2): 
CAISPPDAAK AAPLRTITAD TFRKLFRVYS NFLRGKLKLY 
TGEACRTGDR-carboxylate (where Lys'^^ is modified with a levulinic 
acid residue at the e-amino group, and where Cys"^ is Acm protected) 

5 Segment SEP-1:1 (GRFN 1711, composed of residues 1-32 of SEQ ID N0:2): 

APPRLICDSR VLERYLLEAK EAEKITTGCA EH-thioester 
(where Lys24 is modified with a levulinic acid residue) 

Segment SEP- 1:1 (GRPN 1711) was synthesized on a thioester-generating 
resin, and Segment SEP-1:4 (GRFN 1776) on a -0CH2-Pam-resin as in Example 1. 

10 Lysines 24 and 126 of these two peptide segments were initially protected with an 
Fmoc group at the e-amino group. After completion of the chain assembly, the 
Fmoc-bearing amino groups were deprotected following standard Fmoc deprotection 
procedures and modified by attachment of levulinic acid to each peptide resin, 
respectively. The peptides were then deprotected and simultaneously cleaved fi-om 

15 the resin support as described in Example 1. The peptides were purified by 
preparative C4 reverse phase HPLC. Fractions containing pure peptide were 
identified usmg ES-MS, pooled and lyophilized for subsequent ligation. GRFNP32 
[-(EDA-Succ)i8 carboxylate] was assembled on a Sasrin carboxyl-generating resin 
following standard protocols (Rose, K. et al, U.S. Patent Application Serial No. 

20 09/379,297; Rose, et al., J Am Chem &c.(1999) 121: 7034). An aminooxyacelyl 
(AoA) moiety was attached to the N-terminal amino group of the polymer by 
coupling a fivefold excess of activated Boc-aminooxyacetic acid. The polymer 
chain was cleaved from the resin support using classic Fmoc-chemistry procedures. 
The polymer chain was purified by preparative C4 reverse-phase HPLC. Fractions 

25 containing pure polymer were identified using ES-MS, pooled and lyophilized for 
subsequent ligation. 

Segment SEP-1:4 and GRFNP32 were jointly dissolved at an equimolar ratio 
in 50% aqueous acetonitrile containing 0.1% TFA . The solution was then 
lyophilized. The dried powder was dissolved and the polymer-modified peptide 
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separated from unmodified peptide and unreacted polymer by preparative gradient 
C4 reverse-phase HPLC. Fractions containing flie desired oximated product SEP- 
1:4 + GP32 were identified by ES-MS and pooled and lyophilized. 

Segment SEP-1:1 and GRFNP32 were jointly dissolved at an equimolar ratio 
in 50% aqueous acetonitrile containing 0.1% TFA. * The solution was then 
lyophilized. The dried powder was dissolved and the polymer-modified peptide 
separated from unreacted polymer by preparative gradient C4 reverse-phase HPLC. 
Fractions containing the desired oximated product SEP-1:1 + GP32 were identified 
by ES-MS and pooled and lyophilized. 

B. Synthesis of Synthetic Erythropoiesis Stimulating Protein SEP-1- 
L30 

SEP-1-L30 was synthesized m solution from four polypeptide segments: 

Segment SEP-1:1+GP32 (GRFN 1711 + GRFNP32, corresponding to residues 1-32 
ofSEQIDNO:2): 

APPRLICDSR VLERYLLEAK EAEltf^'^ITTGCA ' EH-thioester (where Lys^* 
is modified at the e-amino group with a levulinic oxime linker group that is 
coupled to GRFNP32 through a levulinic-ammooxyacetyl (Lev-AoA) oxfane 
bond denoted K**'') 

Segment SEP-1:2 (GRFN 1712, corresponding to residues 33-88 of SEQ ED NO:2): 
20 CSLNEKIT VPDTKVNFYA WKRMEVGQQA VEVWQGLALL 

SEAVLRGQAL LVKSSQPW-thioester (where Cys^^ is Acm protected) 

Segment SEP-1:3 (GRFN 1713, corresponding to residues 89-116 of SEQ ID 
NO:2): 

CP LQLHVDKAVS GLRSLTTLLR ALGAQK-thioester (where Cys*^ is 
25 Acm protected) 



wo 02/20034 



PCT/USOl/21935 



-74- 

Segment SEP:1:4+GP32 (GRFN 1776 + GRFNP32, corresponding to residues 117- 

166 ofSEQroNO:2): 

CAIS PPDAAK°*AAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
CRTGDR-carboxylate (where Lys^^^ is modified at the e-amino group with a 
5 levulinic oxhne linker group that is coupled to GRFNP32 through a 

levulinic-aminooxycetyl (Lev-AoA) oxime bond denoted K"'^ and where the 
C-terminal cysteine carries a picolyl (pico) protecting group) 

Synthesis of additional peptides, ligation reactions, carboxymethyiation, 
protecting group removal reactions, folding and purification were performed as 
10 described in Example 1 to yield full-length, folded SEP-1-L30. An analytical 
reverse-phase HPLC chromatogram and an ES-MS spectrum of the folded protem 
product as well as a CD spectrum demonstrated the presence of folded protem 

Example 3 

Synthesis of Synthetic Erythropoiesis Stimulating Protein SEP-1-L26 

15 A third synthetic erythropoiesis stimulatmg protein (designated SEP-1 -L26) 

was synthesized to contain oxime-forming groups at positions 24 and 126 of SEP-0. 
These groups were then used to form SEP-1 -L26, in which the linear polymers 
(EDA-Succ)ig carboxylate and (EDA-Succ)6-amide have been joined to the protein 
backbone through oxime linkages at positions 24 and 126, respectively. The 

20 sequence of the full-length SEP-1 (1-166) is: 

APPRLICDSR VLERYLLEAK EAEK^^'ITTGCA EHCSLNEKIT 

VPDTKVNFYA WKRMEVGQQA VEVWQGLALL SEAVLRGQAL 

LVKSSQPW\|;P LQLHVDKAVS GLRSLTTLLR ALGAQKv|/AIS 

PPDAAK^^'AAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
25 CRTGDR (SEQ ID NO: 2) 

where T denotes an non-native ammo acid residue consisting of a cysteine 
that is carboxymethylated at the sulfhydryl group, and where K^^ denotes a non- 
native lysine that is chemically modified at the e-amino group with an oxime linker 
group coupled to a designated water-soluble polymer through an oxime bond. 
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In contrast to SEP-1-L30, the SEP-1-L26 construct was designed to bear a 
smaller and uncharged water-soluble polymer attached at position 126. The polymer 
attached at position 24 was the same as in SEP-1-L30. Assembly of the full-length 
product was as described in Example 2. 

5 A. Oximation of GRFN1776 with GRBNP6 and oximation of 

GRFN1711 with GRENP32 

Oxime fomiation was performed to attach water-soluble polymers bearing an 
aminooxyacetyl group to peptides carrying a ketone carboijiyl group. To accomplish 
this, the following peptide segments were synthesized: 

10 Segment SEP-1:4 (GRPN 1776; composed of residues 1 17-166 of SEQ BD NO:2): 
CAISPPDAAK AAPLRTITAD TFRKLFRVYS NFLRGKLKLY 
TGEACRTGDR-carboxylate (where Lys^"^ is modified with a levulinic 
acid residue at the e-amino group, and where Cys^ is Acm protected) 

Segment SEP-1:1 (GRFN 1711, composed of residues 1-32 of SEQ ID NO:2): 
15 APPRLICDSR VLERYLLEAK EAEKITTGCA EH-thioester 

(where Lys^"^ is modified with a levulinic acid residue) 

Segment SEP-1:1 (GRFN 1711) was synthesized on a thioester-generating 
resm, and Segment SEP-1:4 (GRFN 1776) on a -0CH2-Pam-resin as in Example 1. 

20 Lysines 24 and 126 of these two peptide segments were initially protected with an 
Fmoc group at the e-amino group. After completion of the chain assembly, the 
Fmoc-bearing amino groups were deprotected following standard Fmoc deprotection 
procedures and modified by attachment of levulinic acid to each peptide resin, 
respectively, following standard coupling protocols. The peptides were then 

25 deprotected and sunultaneously cleaved from the resin support according to standard 
Boc-chemistry procedures as in Example 1. TTie peptides were separately purified 
by preparative C4 reverse-phase HPLC. For each peptide, fractions containing pure 
peptide were idraitified using ES-MS, pooled and lyophilized for subsequent 
ligation. 
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The water soluble polymer (EDA-Succ)i8 carboxylate (GRFNP32) was 
assembled on a Sasrin carboxy-generating resin following standard protocols (Rose, 
K. et aU U.S. Patent Application Serial No. 09/379^97; Rose, et al., J Am Chem 
Soci\999) 121: 7034). The water soluble polymer (EDA-Succ)6-amide (GRFNP6) 
5 was assembled on a Sieber amide-generating resin following standard protocols 
(Rose, K. et al, U.S, Patent Application Serial No. 09/379,297; Rose, et al., J Am 
Chem Soc.{l999) 121: 7034). An arainooxyacetyl (AoA) moiety was attached to the 
N-terminal amino group of each resin-bound polymer by coupling a fivefold excess 
of activated Boc-aminooxyacetic acid. The two polymer chains were separately 
10 cleaved from the respective resin supports using classic Fmoc-chemistry 
procedures. Each polymer chain was purified by preparative reverse phase HPLC. 
For each polymer, fractions containing pure polymer were identified using ES-MS, 
pooled and lyophilized for subsequent ligation. 

Segment SEP-1:4 and GRFNP6 were jointly dissolved at an equimolar ratio 
15 m 50% aqueous acetonitrile containing 0.1% TFA. The solution was then 
lyophilized. The dried powder was dissolved and the polymer-modified peptide 
separated from unmodified peptide and unreacted polymer by preparative gradient 
C4 reversfe-phase HPLC. Fractions containing the desired oximated product SEP- 
1:4 + GP6 were identified by ES-MS and pooled and lyophilized. 
20 Segment SEP-1 : 1 and GRFNP32 were jomtly dissolved at an equimolar ratio 

in 50% aqueous acetonitrile containmg 0.1% TFA. The solution was then 
lyophilized. The dried powder was dissolved and the polymer-modified peptide 
separated from unreacted polymer by preparative gradient C4 reverse-phase HPLC. 
Fractions containing the desired oximated product SEP-1:1 + GP32 were identified 
25 by ES-MS and pooled and lyophilized. 

B. Synthesis of Synthetic Erythropoiesis Stimulating Protein SEP-1- 
L26 

SEP-1-L26 was synthesized in solution from four polypeptide segments: 
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Segment SEP:1:1+GP32 (GRFN 1711 + GRFNP32, corresponding to residues 1-32 
ofSEQIDNO:2): 

APPRLICDSR VLERYLLEAK EAEK°*ITTGCA EH-thioester (where Lys^* 
is modified at the e-amino group with a levulinic oxime linker group that is 
5 coupled to GRFNP32 through a levulinic-aminooxyacetyl (Lev-AoA) oxime 

bond denoted K^'') 

Segment SEP-1:2 (GRFN 1712, corresponding to residues 33-88 of SEQ ID NO:2): 
CSLNEKIT VPDTKVNFYA WKRMEVGQQA VEVWQGLALL 
SEAVLRGQAL LVKSSQPW-thioester (where Cys^^ is Acm protected) 

10 Segment SEP-1:3 (GRFN 1713, corresponding to residues 89-116 of SEQ ID 
N0:2): 

CP LQLHVDKAVS GLRSLTTLLR ALGAQK-thioester (where Cys*' is 
Acm protected) 

Segment SEP:1:4-KjP6 (GRFN 1776 + GRFNP6, corresponding to residues 117-166 

15 ofSEQroNO:2): 

CAIS PPDAAK^AAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
CRTGDR-carboxylate (where Lys'^^ is modified at the e-amino group with a 
levulinic oxime luiker group that is coupled to GRFNP6 through a levulinic- 
aminooxyacetyl (Lev-AoA) oxime bond denoted K"** and where the C- 

20 tenninal cysteine [i.e. Cys^^^] carries a picolyl (pico) protecting group) 

Synthesis of additional peptides, ligation reactions, carboxymethylation, 
protecting group removal reactions, folding and purification were perfonned as 
described in Examples 1 and 2 to yield full-length, folded SEP-1-L26. An analytical 
C4 reverse-phase HPLC chromatogram and an ES-MS spectrum of the folded 
25 protem product as well as a CD spectrum demonstrated the presence of folded 
protein. 
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Example 4 

Synthesis Of Synthetic Erythropoiesis Stimulating Protein SEP-l-BSO 

A fourth synthetic erythropoiesis stimulating protein (designated SEP-1-B50) 
was synthesized. The amino acid sequence of the fiill-length SEP-1-B50 is the same 
5 asthatofSEP-l-L30: 

APPRLICDSR VLERYLLEAK EAEK^'^'ITTGCA EHCSLNEKIT 

VPDTKVNFYA WKRMEVGQQA VEVWQGLALL SEAVLRGQAL 

LVKSSQPW\i/P LQLHVDKAVS GLRSLTTLLR ALGAQK\|/AIS 

PPDAAK***AAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
10 CRTGDR (SEQ ID NO: 2) 

where T denotes an non-native ammo acid residue consisting of a cysteine 
that is carboxymethylated at the sulfhydryl group, and where K**" denotes a non- 
native lysuie that is chemically modified at the e-amino group with an oxime Imker 
group coupled to a designated water-soluble polymer through an oxime bond. 
15 However, the protein was derivatized with a branched polymer construct 

having four linear (Succ-EDA)i2 moieties rather than the linear (Succ-EDA)i8 
polymer of SEP-1-L30. Derivatization was accomplished via oxime linkages. 
Assembly of the full-length product was as described in Example 2. 

A. Synthesis of Template 6RFNP17 Carrying Multiple Thiol 
20 Groups 

The template GRFNP17 was synthesized manually on an amide generating 
(4-methyl)benzhydrylamine(MBHA)-resin on a 0.4mmol scale. Fmoc-Lys(Boc)-OH 
was coupled using standard coupling protocols (SchnSlzer, M., Int J Pept Protein 
Res. (1992) 40:180-93). 2.1mmol amino acid, 10% DIEA in 3.8 ml 0.5M HBTU 

25 was used; i.e. 5-fold excess of ammo acid. After removal of the Fmoc protecting 
group, Fmoc-Lys(Fmoc)-OH was coupled using standard amino acid coupling 
protocols (2.1 mmol ammo acid, 10% DIEA in 3.8 ml 0.5M HBTU; i.e. 5-fold 
excess amino acid). After a second Fmoc removal step, Fmoc-Lys(Fmoc)-0H was 
coupled using standard amino acid coupling protocols (4.2 mmol amino acid, 10% 

30 DIEA in 7.6 ml 0.5M HBTU; i.e. 5-fold excess amino acid relative to free amine). 
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After a final Fmoc deprotection step, a five-fold excess (relative to fiw amines) of 
S-acetyl thiogiycolic acid pentafluorophenyl ester (SAMA-oPfy) in DMF was 
coupled for 30 minutes. The Boc protecting group of the side chain of the C- 
temiinal lysyl residue was removed by two times one minute batch washes with neat 

5 TFA, followed by neutralization of the resin by washing with 10% DIEA in DMF. 2 
mmol Boc-aminooxyacetic acid and 2 mmol N-hydroxysuccinimide (NHS) were 
dissolved m 3 ml DMF. After addition of 2 nunol DIG (diisopropylcarbodiimide), 
the acid was activated for 30-60 minutes. The solution was added to the neutralized 
resin and coupled 1 hr. Finally, the S-linked acetyl groups were removed with 20% 

10 piperidine in DMF for 30 minutes. The template was deprotected and 
simultaneously cleaved from the resin support using HF/p-cresol accordmg to 
standard Boc-chemistry procedures m the presence of cysteine as a scavenger for 
free aldehyde (Schnolzer, M., Int J Pept Protein Res, (1992) 40:180-93). The 
recovered polyamide in 50% B [i.e. 50% aqueous acetonitrile containing 0.1%TFA] 

15 (aldehyde free) was frozen and lyophilized. For purification, the template crude 
product was dissolved in 2 ml 50% B, and 100 ml 100% A [i.e. 0.1% TFA in water] 
was added to dilute the sample (Avoid guanidmium chloride or acetate addition, 
since the addition of aldehyde is guaranteed). The template was loaded onto a C4 
preparative reverse-phase HPLC column equilibrated at T = 40''C at 3% B. Salts 

20 were eluted isocratically and the desfred template, GRFNP17 purified in a linear 
gradient. Fractions containing the desired product were identified by ES-MS, pooled 
and lyophilized. 

B. Synthesis of Branched Water-Soluble Polymer GRNP29 
GRFNP29, a branched (EDA-Succ)i2 polymer of 15kDa molecular weight 
25 was synthesized by tiiioether-generating ligation of purified thiol-containing 
template GRFNP17 and a linear polymer GRFNP31, Br-acetyl-(EDA-Succ)i2 
carboxylate , where GRFNP31 was synthesized on a Sasrin caiboxy-generating resin 
following standard protocols (Rose, K. et al, U.S. Patent Application Serial No. 
09/379^97; Rose, et al, J Am Chem Soc.(l999) 121: 7034). 
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A 1.3x molar excess (over total thiols) of the purified GRFNP31, Br- 
acetylated (EDA-Succ)i2, and purified thiol-containing template GRFNP17 were 
jointly dissolved 0.1 M Tris -HCl/ 6 M guanidmium chloride, pH 8.7 at -10 mM 
concentration. After dissolution, the solution was diluted threefold (v/v) with 0.1 M 
5 Tris -HCl, pH 8.7 buffer. The ligation mixture was stirred at room temperature and 
the reaction monitored by reversed-phase HPLC and ES/MS. Additional GRFNP31 
reactant was added on an as-needed basis until the desired reaction product was the 
major product For workup, 3x (v/v to ligation mix) O.IM acetate / 6 M 
guanidinium chloride, pH 4 was added, and the solution was loaded onto a 
10 preparative C4 reverse-phase HPLC column, and purified with a linear gradient. 
Fractions containing pure GRFNP29 construct were identified using ES-MS, pooled 
and lyophilized. 

C. Oximation of GRFN1776 and GRFN1711 with GKFNP29 

Segments SEP-1;4, and Segment SEP-1:1 were synthesized as described in 
1 5 Example 2. Segment SEP-1 :4 and GRFNP29 were jointly dissolved at an equunolar 
ratio in 50% aqueous acetonitrile containmg 0.1% TFA . The solution was 
lyophilized. The dried powder was loaded onto a preparative reverse-phase HPLC 
column (1 inch diameter). The polymer-modified peptide was separated from 
unmodified peptide and unreacted polymer by preparative gradient C4 reverse-phase 
20 HPLC. Fractions containmg the desired oximated product SEP:1:4+GP29 were 
identified by ES-MS and pooled. 

Segment SEP-1:1 and GRFNP29 were jointly dissolved at an equimolar ratio 
in 50% aqueous acetonitrile containing 0.1% TFA. The solution was fi-ozen and 
lyophilized. The dried powder was dissolved in 50% aqueous acetonitrile containing 
25 0.1% TFA and loaded onto a preparative GPC (gel permeation chromatography) 
column (1 inch diameter). The polymer-modified peptide was separated fi*om 
unmodified peptide and unreacted polymer by isocratic elution. Fractions containing 
the desired oximated product SEP-1 :1-KjP29 were identified by ES-MS and pooled. 
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D. Synthesis Of Synthetic Erythropoiesis Stimulating Protein SEP- 
1-BSO 

SEP-1-B50 (SEQ ID NO:2) was synthesized in solution from four 
polypeptide segments: 

5 

Segment SEP-1:1+GP29 (GRFN 1711 + GRFNP29, corresponding to residues 1-32 
ofSEQIDNO:2): 

APPRLICDSR VLERYLLEAK EAElC^'^ITTGCA EH-thioester (where Lys^^ 
is modified at the s-ammo group with a levulinic oxime linker group that is 
10 coupled to GRFNP29 through a levulinic-aminooxyacetyl (Lev-AoA) oxime 

bond denoted K°*) 

Segment SEP-1:2 (GRFN 1712, corresponding to residues 33-88 of SEQ ID N0:2): 
CSLNEKIT VPDTKVNFYA WKRMEVGQQA VEVWQGLALL 
SEAVLRGQAL LVKSSQPW-thiocster (where Cys^^ is Acm protected) 

15 Segment SEP-1:3 (GRFN 1713, corresponding to residues 89-116 of SEQ ID 
NO:2): 

CP LQLHVDKAVS GLRSLTTLLR ALGAQK-thioester (where Cys*^ is 
Acm protected) 

Segment SEP11:4+GP29 (GRFN 1776 + GRFNP29, corresponding to residues 117- 

20 166ofSEQIDNO:2): 

CAIS PPDAAK°*AAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
CRTGDR-carboxylate (where Lys^^^ is modified at the e-amino group with a 
levulinic oxime linker group that is coupled to GRFNP29 through a 
levulinic-aminooxyacetyl (Lev-AoA) oxime bond denoted K°'^ and where the 

25 C-termmal cysteine carries a picolyl (pico) protecting group) 

Synthesis of additional peptides, ligation reactions, carboxymethylation, 
protecting group removal reactions, folding and purification were performed as 
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described in Examples 1 and 2, except that purification was on a Resource Q 
column, to yield full-length, folded SEP-1-B50 (SEQ ID NO: 2). An analytical C4 
reverse-phase HPLC chromatogram and an ES-MS spectrum of the folded protein 
product SEP-1-B50 as well as a CD spectrum demonstrated the presence of folded 
5 protein. 

Example 5 

Synthesis Of Synthetic Erythropoiesis Stimulating Protein SEP-3-L42 

A fifth synthetic erythropoiesis stimulating protein (designated SEP-3-L42) 
10 was synthesized. The amino acid sequence of the full-length SEP-3 protem is: 

APPRLICDSR VLERYLLEAK EAECITTGCA EHCSLNECIT 

VPDTKVNFYA WKRMEVGQQA VEVWQGLALL SEAVLRGQAL 

LACSSQPWEP LQLHVDECAVS GLRSLTTLLR ALGAQKEAIS 

PPDAACAAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 
15 CRTGDR (SEQ ID NO: 3) 

The cysteine residues in positions: 24, 38, 83, and 126 were modified wilh 
maleimide-fimctionalized (EDA-Succ)i8 (GRPNP32) polymer units (via a Michael 
addition reaction) to form SEP-3-L42. 

Jn detail, SEP-3 was synthesized in solution fi'om four polypeptide segments: 

20 Segment SEP-3:1 (GRFN 1747, corresponding to residues 1-37 of SEQ ID NO:3): 
APPRLICDSR VLERYLLEAK EAECITTGCA EHCSLNE-thioester 
(where Cys^ Cys^^ and Cys^^ are Acm protected) 

Segment SEP-3:2 (GRFN 1774, corresponding to residues 38-82 of SEQ ID NO:3): 
CIT VPDTKVNFYA WKRMEVGQQA VEVWQGLALL 
25 SEAVLRGQAL LA-thioester (where Cys^^ is side chain protected with the 

Acm group) 



Segment SEP-3:3 (GRFN 1749, corresponding to residues 83-125 of SEQ ID 
NO:3): 
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CSSQPWEP LQLHVDKAVS GLRSLTTLLR ALGAQKEAIS 
PPDlVA-thioester (where Cys*^ is side chain protected with the Acm group) 

Segment SEP-3:4 (GRFN 1750, corresponding to residues 126-166 of SEQ ID 
NO:3): 

5 CAAPL RTITADTFRK LFRVYSNFLR GKLKLYTGEA 

CRTGDR-carboxylate (where Cys^^^ is Pbo [i.e., 4-(CH3S(0)-)benzyl- ] 
protected) 

Peptide synthesis. The peptides SEP-3:1 and SEP-3:2 and SEP-3:3 were 
synthesized on a thioester-generating resin by the in situ neutralization protocol for 

10 Boc chemistry SPPS, using established side-chain protection strategies as described 
in Example 1, with changes in protecting group strategy as noted in the specific 
peptides above. Segment SEP-3:4 was synthesized analogously on a -0CH2-Pam- 
resin. The peptides were deprotected and simultaneously cleaved from the resin 
support as described in Example 1. The resulting peptides described above were 

15 purified by preparative RP-HPLC. Fractions containing pure peptide were identified 
using ES-MS, pooled and lyophilized for subsequent ligation. 

\ Step 1: Ligation #L Segment SEP-3:4 and segment SEP-3:3 were dissolved 

in TFE at 15 mM concentration. Saturated phosphate buffer (pH 7.5) containmg 6 
M guanidinium chloride and 1% thiophenol was added, resultmg in a clear solution 

20 of the peptide segments. After ligation, the ligation mix (defined as 1 volume) was 
added to 2 volumes of a solution of {2 ml TFE, 6 ml 6 M guanidinium chloride, 100 
mM phosphate containing 25% p-mercaptoethanol} and incubated for 20 minutes. 
Hie solution was acidified with a solution of 15 mg/ml TCEP in glacial acetic acid, 
loaded onto a preparative C4 reverse-phase HPLC colunm and purified with a linear 

25 gradient. Fractions contaming the desired ligated product SEP-3:Acm+SEP- 
3:3+SEP-3:4 were identified by ES-MS and pooled. 

Step 2: Acm-removal #1 For Acm removal, the aqueous acetonitrile solution 
containing the pooled flections of SEP-3:Acm+SEP'3:3+SEP-3:4 was diluted Ix 
with HPLC grade water, and solid urea was added for a final concentration of 2 

30 molar. A threefold molar excess (relative to the total expected cysteine 
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concentration) of a 30 mg/ml Hg(acetate)2 solution in 3% aqueous acetic acid was 
added and the solution was stirred for one hour. The solution was then made 20% in 
p-mercaptoethanol, loaded onto a semi-preparative C4 reverse-phase HPLC column 
and purified with a step gradient. Fractions containing the desired ligated product 
5 SEP-3 :3+SEP-3 :4 were identified by ES-MS and lyophilized overnight 

Step 3: Ligation #2 Equal amounts of SEP-3 :3+SEP-3:4 and SEP-3.2 were 
jointly dissolved in neat TFE trifluoroethanol at 15 mM concentration. 250 mM 
Phosphate buffer (pH 7.5) containing 6 M guanidinium and 1% thiophenol was 
added, resulting in a clear solution of the peptide segments. After one day of 

10 ligation, the ligation mix (defined as 1 volume) was added to 2 volumes of a solution 
of 10 ml TFE, 10 ml p-mercaptoethanol, 10 ml piperidme and 20 ml 6M 
guanidinium, pH 4, and incubated for 20 minutes to remove any remaining 
. protecting groups. The solution was acidified with a solution of 15 mg/ml TCEP in 
20% aqueous acetic acid, loaded onto a preparative C4 reverse-phase HPLC column 

15 and purified with a linear gradient. Fractions containing the desired ligated product 
SEP-3:Acm4-SEP-3:2+SEP-3:3+SEP-3:4 were identified by ES-MS and lyophilized 
overnight. 

Step 4 Acm removal For Acm removal, the aqueous acetonitrile solution 
containing tiie pooled fractions of SEP-3 :Acm+SEP-3:2+SEP-3:3+SEP-3:4 was 

20 diluted Ix with HPLC grade water, and solid urea added for a final concentration of 
2 molar. A threefold molar excess (relative to the total expected cysteine 
concentration) of a 30 mg/ml Hg(acetate)2 solution in 3% aqueous acetic acid was 
added and the solution stirred for one hour. The solution was then made 20% in p- 
mercaptoetiianol, loaded onto a C4 semi-preparative reverse-phase HPLC colunm 

25 and purified with a step gradient. Fractions containing the desired ligated product 
SEP-3:2+SEP-3:3+SEP-3:4 were identified by ES-MS and lyophilized ovemi^t. 

Step 5: Ligation #3 Equal amounts of SEP-3:2+SEP-3:3+SEP-3:4 and 
SEP:3:1 were jomtiy dissolved in neat TFE at 15 mM concentration. 250 mM 
Phosphate buffer (pH 7.5) containing 6 M guanidinium and 1% thiophenol was 

30 added, resultmg in a clear solution of the peptide segments. After one day of 
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ligation, the ligation mix (defined as 1 volume) was added to 2 volumes of a solution 
of 10 mi TFE, 10 ml P-mercaptoethanol, 10 ml piperidine and 20 ml 6M 
guanidinium, pH 4, and incubated for 20 minutes to remove any remaining 
protecting groups. liie solution was acidified with a solution of 15 mg/ml TCEP in 
5 20% aqueous acetic acid, loaded onto a preparative C4 reverse-phase HPLC column 
and purified with a linear gradient. Fractions containing the desired ligated product 
SEP-3:l+SEP-3:2+SEP-3:3+SEP-3:4 were identified by ES-MS and lyophilized 
overnight. 

Step 6: Attachment of the Polymer GRFNP32. A maleimide-functionalized 

10 linear (EDA-Succ)i8 polymer called GRFNP32-maleimide was prepared by 
functionalizing GRFNP32 with BMPS (3-maIeimido propionic acid NHS ester, 
Pierce, USA) following the manufacturers protocols to form a maleimide- 
fimctionalized (EDA-Succ)i8 polymer [i.e., maleimide-(EDA-Succ)i8]. SEP- 
3:l+SEP-3:2+SEP-3:3+SEP-3:4 was dissolved in the minimum amount of TFE 

15 required. A threefold excess of GRFNP32-maleimide was dissolved in 6M 
guanidinium chloride, 100 mM phosphate, pH 7.5 and added to the TFE solution. 
• The progress of the Michael addition reaction was followed by analytical reverse- 
phase HPLC. After the reaction was complete, the solution was loaded onto a 
preparative C4 reverse-phase HPLC column and purified with a linear gradient. 

20 Fractions containing the desired polymer-modified product SEP3:Acm+SEP- 
3:l+SEP-3:2+SEP-3:3+SEP-3:4+pPEG [i.e. the ligated fiiU-lengtti 166 residue 
polypeptide cham with four copies of GRFNP32 attached to the side chain thiols of 
Cys^\ Cys^^ Cys^ and Cys"* , and tiius also called SEP3:Acm+SEP-3:l+SEP- 
3:2+SEP-3:3+SEP-3:4-KjP32], were identified by ES-MS and lyophilized overnight 

25 Step 7: Pbo removal For Pbo reduction, the lyophilized powder of 

SEP3:Acm+SEP-3:l+SEP-3:2+SEP-3:3+SEP-3:4+GP32 was dissolved in neat TFA 
containing 5% ethanedithiol. The Pbo group was then cleaved by addition of 10% 
thioanisole and 15% bromotrimetbylsilane for 30 minutes. The solution was dried in 
a rotator-evaporator and taken up in aqueous acetonitrile containing 0.1% TFA. The 

30 resulting solution was loaded onto a semi-preparative reverse-phase HPLC column 
and purified with a stq) gradient Fractions containing the desired Cys^^^-deprotected 
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product SEP3:Acm+SEP-3:l+SEP-3:2+SEP-3:3+SEP-3:4+GP32-Pbo were 
identified by ES-MS and lyophiiized overnight 

Step 8: Acm removal For final Acm removal fi-om the side chains of Cys^, 
Cys^^ and Cys^^, the aqueous acetonitrile solution containing the pooled fractions of 
5 SEP3:Acm+SEP-3:l+SEP-3:2+SEP-3:3+SEP-3:4+GP32-Pbo was diluted Ix with 
HPLC grade water^ and solid urea was added for a final concentration of 2 molar. A 
threefold molar excess (relative to the total expected cysteine concentration) of a 30 
mg/ml Hg(acetate)2 solution in 3% aqueous acetic acid was added and the solution 
was stirred for one hour. The solution was then made 20% in p-mercaptoethanol, 

10 loaded onto a semi-preparative C4 reverse-phase HPLC column and purified with a 
step gradient. Fractions containing the desired ligated, polymer-modified product 
SEP-3 (1-166) were identified by ES-MS and lyophiiized overnight. 

Step 9 Folding: Full-length ligated, polymer-modified peptide SEP-3 (1- 
166) was dissolved in 200 mM Tris buffer (pH 8.7) containing 6 M guanidinium 

15 chloride and 20% TFE and a ten-fold molar excess (relative to Cys residues in SEP- 
3) of cysteine. This solution was dialyzed overnight against a solution of 200 mM 
Tris buffer (pH 8.7) containing 3 M guanidinium chloride at room temperature. The 
solution was then dialyzed against a solution of 200 mM Tris buffer (pH 8.7) 
containing 1 M guanidinium chloride for 4 hours at 4 and finally against 10 mM 

20 phosphate buffer (pH 7.0) for 4 hours at 4*C to yield the final folded product. 
Folding was verified by electrospray ES-MS and CD spectrometry. 

Step 10 Purification: The folded polypeptide was concentrated 5x m 
centricon concentrator vials and loaded on to Resource S cation exchange column 
equilibrated at 10 mM phosphate, pH 7.0. The folded protein was eluted in a linear 

25 salt gradient to 500 mM NaCl in 10 minutes. Fractions containing the desired folded 
product SEP-3- L42 were identified by SDS-PAGE , and fix)zen and stored at -80 "C. 
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Example 6 

Bioactivity Assay of Synthetic Erythropoiesis Stimulating Proteins 

The bioactivity of folded synthetic erythropoiesis stimulating proteins, SEP- 
0, SEP-1-L26, SEP-1-L30, SEP-1-B50, and SEP-3-L42 was determined using UT/7 

5 and 32D 103197 cell lines, in factor-dependent cell-line proliferation assays using 
conmiercial recombinant erythropoietin as a control standard. UT-7 is a human 
megakaryoblastic leukemia cell line with absolute dependence on one of interleukin- 
3, granulocyte-macrophage colony-stimulating factor (GM-CSF), or erythropoietin 
(EPO) for growth and survival (Miura Y, et al., Acta Haematol (1998) 99:180-184); 

10 Komatsu N, et al., Cancer Res. (1991) 51:341-8). 32D 103197 is a murine 
hemopoietic cell line (Metcalf, D. Int J Cell Cloning (1992) 10:116-25). 

Stock-solutions of the SEP constructs were made in Iscove's modified 
Dulbecco*s mediuni (IMDM), 10% FBS (Fetal bovine serum), glutamine and 
Penstrep, and serial 2x dilutions of these stock solutions were added to multi-well 

15 plates to which human UT/7 EPO cells at a concentration of 5000 cells/50 ^il were 
added. The plates were incubated at 37 'C in the presence of 5% CO2 and monitored 
daily for growth. After four days, 20 jil 2.5 mg/ml MTT (methylthiazol tetrazolium) 
in PBS (phosphate buffered saline) was added and the plates were incubated for four 
hours. 150 pJ IPA was added and the absorbance of each well was read at 562 nm. 

20 The ED50 (effective dose to reach 50% of maximum effect) values for the SEP 
compounds was determined and compared to that of CHO (Chinese hamster ovary) - 
cell produced rhEPO (recombinant human erythropoietin). The results fix)m these 
experiments demonstrated that all of the synthetic erythropoiesis stimulating proteins 
exhibited bioactivity. ED50 results for SEP-0, SEP-1-L26, SEP-1-L30, and SEP-1- 

25 B50 are shown in Table VI. 
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Table VI 



Erythropoiesis 
Stimulating Protein 


In Vitro EDSO Values (pM) 


TJT-7 
(Human) Cells 


32D 103197 
(Mouse) Cells 


SEP-0 


1,570 


863 


SEP-1-L26 


46.5 


100.8 


SEP-1-L30 


71.5 


182.5 


SEP-1-B50 


182 


6200 


AEPO 


32.5 


136.3 



While the invention has been described in connection with specific 
embodiments thereof, it will be understood that it is capable of fiirther modifications 
and this application is intended to cover any variations, uses, or adaptations of the 
5 invention following, in general, the principles of the invention and including such 
departures from the present disclosure as come within known or customary practice 
within the art to which the invention pertains and as may be applied to the essential 
features hereinbefore set forth. 
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Whatlsaaimed Is : 

1. A synthetic protein containing a pseudo-amino acid residue whose side chain 
has the formula: -S- Raa, where Raa is an optionally substituted terminal 
portion of a ribosomally-specified amino acid side chain, or an analog of said 

5 terminal portion of a ribosomally-specified amino acid side chain. 

2. The synthetic protein of claim 1, wherein said side chain -S-Raa has the 
same chain length as the side chain of said ribosomally-specified amino acid. 

3. The synthetic protein of claim 1, wherein said side chain Raa has a 
greater chain length than the side chain of said ribosomally-specified amino 

10 acid. 

4. The synthetic protein of claim 1, wherein said protein has a biological 
activity possessed by a ribosomally-specified bioactive protein. 

5. The synthetic protein of claim 4, wherein said side chain -S- Raa of said 
pseudo amino acid residue mimics the structure or function of a ribosomally- 

15 specified amino acid at a corresponding position in said bioactive protein. 

6. The synthetic protein of claim 4, wherein said side chain -S- Raa of said 
pseudo amino acid residue alters the structure or fiinction of a ribosomally- 
specified amino acid at a corresponding position in said bioactive protein. 

7. The synthetic protein of claim 4, wherein said synthetic protein has a 
20 monomer molecular weight greater than about 25 kDa. 

8. The synthetic protein of claim 4, wherein said ribosomally produced 
bioactive protein contains a cysteine residue. 

9. The synthetic protein of claim 8, wherein said pseudo amino acid residue is 
at a position other than one correspondmg to the position of said cysteine 

25 residue in said ribosomally produced bioactive protein. 
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10. The synflietic protein of claim 1, wherein a plurality of amino acid residues 
of said protein are boimd to adjacent amino acid residues by an amide bond. 

1 1 . The synthetic protein of clakn 10, wherein each amino acid residue is bonded 
to its adjacent amino acid residues by amide bonds. 

5 12. The syntibietic protein of claim 1, wherem said protein is composed of a 
plurality of amino acid residues wherein at least one of such amino acid 
residues is bonded to an adjacent amino acid residue by a non-amide bond. 

13. The synthetic protein of claim 12, wherein said non-amide bond is selected 
&om tihe group consisting of a thioester bond, a thioether bond, and an oxime 

10 bond. 

14. The synthetic protein of claim 1, wherein said pseudo-amino acid residue is a 
D-configuration amino acid residue. 

15. The synthetic protein of claim 1, wherein said pseudo-amino acid residue is 
an L-configuration amino acid residue. 

15 16. The synthetic protein of claim 1, wherem said pseudo-amino acid residue is 
selected from the group consisting of a pseudo-arginine; a pseudo- 
asparagine; a pseudo-aspartate; a pseudo-dopamine; a pseudo-giutamate; a 
pseudo-glutamine; a pseudo-histidine; a pseudo-isoleucine; a pseudo-leucine; 
a pseudo-lysine; a pseudo-methionine; a pseudo-phenyalanine; a pseudo- 

20 serine; a pseudo-threonine; a pseudo-tiyptophan; a pseudo-tyrosine; and a 

pseudo-valine. 

17. The synthetic protein of clahn 16 wherein said pseudo-amino acid residue is 
a pseudo-giutamate. 

18. The synthetic protein of claim 4, wherein said ribosomally produced 
25 bioactive protein is a mammalian protein. 
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The synthetic protein of claim 18, wherein said mammalian protein is 
selected from the group consisting of a human, simian, bovine, murine, 
porcine, ovine, and equine protein. 

The synthetic protein of claim 18, wherein said manunalian protein is a 
human protein. 

The synthetic protein of any of claims 19-20, wherem said protein has a 
bioactivity selected of a protein receptor or fragment thereof, of a protein 
receptor ligand or fragment thereof, or of a cytokine. 

The synthetic protein of claim 21, wherein said synthetic bioactive protein 
has a bioactivity of a cytokine. 

The synthetic protein of claim 22, wherein said cytokine is selected from the 
group consisting of an interleukin, a lymphokine, a RANTES protein, an 
erythropoiesis stimulating protein, tumor necrosis factor (TNF), an 
interferon, a growth factors and a single peptide hormone. 

The synthetic protein of claim 23, wherein said cytokine is an erythropoiesis 
stimulating protein. 

The synthetic protein of claim24, wherein said erythropoiesis stimulating 
protein is erythropoietin. 

The synthetic protem of claun 25, wherein said synthetic bioactive protein is 
selected from the group consisting of SEP-0, SEP-1, and SEP-3. 

The synthetic protein of claun 22, wherein said cytokine is a RANTES 
protein. 

The synthetic protein of claim 22, wherein said c^okme is a growth factor. 
The synthetic protein of claim 28, wherein said growth factor is G-CSF. 
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The synthetic protein of claim 1, wherein said synthetic protein comprises 
one or more amino acid residues that are modified by one or more polymer 
adducts. 

The synthetic protein of claim 4, wherein said ribosomally produced 
bioactive protem is glycosylated at one or more glycosylation sites. 

The synthetic protein of claim 31, wherein said synthetic protein comprises 
one or more amino acid residues that are modified by one or more polymer 
adducts at amino acid residue(s) that correspond to at least one of said 
glycosylation sites of said ribosomally produced bioactive protein. 

A molecularly homogeneous pharmaceutical composition comprising a 
synthetic protein, wherein said protein possesses a biological activity that 
mimics a biological activity associated with a ribosomally-specified bioactive 
mammalian protein, and has a monomer molecular weight greater than about 
25 kDa, and contains a pseudo-ammo acid residue whose side cham has the 
formula: -S- Raa» where Raa is selected from the group consisting of an 
optionally substituted termmal portion of a ribosomally-specified amino acid 
side chain, or an analog thereof 

The pharmaceutical composition of claim 33, wherein said composition 
comprises a mixture of at least two of said molecularly homogeneous 
pharmaceutical compositions. 

The phannaceutical composition of claim 33, wherein said ribosomally- 
specified bioactive mammalian protem is selected from the group consisting 
of a human, simian, bovine, murine, porcme, ovine, and equine protein. 



36, The pharmaceutical composition of claim 33, wherem said ribosomally- 
25 specified bioactive mammalian protein is a human protein. 
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37. The pharmaceutical composition of claim 36, wherein said synthetic protein 
has a biological activity of a protein receptor or fragment thereof of a protein 
receptor ligand or fragment thereof, or of a cytokine. 

38. The pharmaceutical composition of claim 37, wherein said synthetic protem 
5 has a biological activity of a cytokine. 

39. The pharmaceutical composition of claim 38, wherein said cytokine is 
selected from the group consisting of an interleukin, a RANTES protein, a 
lymphokine, an erythropoiesis stimulating protein, tumor necrosis factor 
(TNF), an interferon, a growth factor and a single peptide hormone. 

10 40. The pharmaceutical composition of claim 39, wherein said cytokine is an 
erythropoiesis stimulating protem. 

41. The pharmaceutical composition of claim 40, wherem said erythropoiesis 
stimulating protein is erythropoietin. 

42. The pharmaceutical composition of claim 41, wherein said synthetic protein 
1 5 is selected from the group consisting of SEP-0, SEP- 1 , and SEP-3. 

43. The pharmaceutical composition of claim 38, wherein said cytokine is a 
RANTES protein. 

44. The pharmaceutical composition of claim 39, wherein said cytokine is a 
growth frictor. 

20 45. The pharmaceutical composition of claim 44, wherein said growth factor is 
G-CSF. 

46. A meftod of treating a human disease or condition that comprises 
administering to an individual in need of such treatment an effective amount 
of a pharmaceutical composition comprising one or more molecularly 
25 homogeneous pharmaceutical compositions each comprising a synthetic 

protein, wherein said synthetic protem has a monomer molecular weight 
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greater than about 25 kDa, and contains a pseudo-amino acid residue whose 
side chain has the formula: 7-S- Raa» where Raa is selected from the group 
consisting of an optionally substituted terminal portion of a ribosomally- 
specified amino acid side chain, or an analog thereof; said synthetic protein 
5 possessing a biological activity that mimics a biological activity of a 

rlbosomally-specified bioactive human protein receptor or fragment thereoj^ 
protein receptor ligand or fragment thereof, or a cytokme. 

47. The method of claim 46, wherein said synthetic protein has a biological 
activity of a cytokine. 

10 48. The method of claim 47, wherem said cytokine is selected from the group 
consisting of an interleukin, a lymphokine, a RANTES protein, an 
erythropoiesis stimulating protein, tumor necrosis factor (TNF), an 
interferon, a growth factor and a single peptide hormone. 

49. The method of claim 48, wherein said cytokine is an erythropoiesis 
1 5 stimulating protein. 

50. The method of claim 49, wherein said erythropoiesis stimulating protein is 
erythropoietin. 

51. The method of claim 50, wherein said synthetic bioactive protein is selected 
from the group consisting of SEP-0, SEP-1, and SEP-3. 

20 52. The method of claim 48, wherein said cytokine is a RANTES protein. 

53. The method of claim 48, wherein said cytokine is a growth factor. 

54- The method of claim 53, wherein said growth factor is G-CSF. 
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55. A method for synthesizing a desired polypeptide of formula: 

aaNH2-Q"aax-aay-W-aacxx)H 
where Q and W each denote the optional presence of one or more 
additional amino acid residues, aaNH^ denotes the N-terminal amino acid 

5 residue of the polypeptide; aax and aay denote internal adjacent amino acid 

residues, havmg side chains x and y, respectively, and aacooH denotes the C- 
terminal amino acid residue of the polypeptide; 
said method comprising: 

(A) ligating a first peptide having the formula: aa^Hj'Q'^x-COSR, 

10 wherein R is any group compatible with the thioester group, 

including, but not limited to, aiyl, benzyl, and alkyl groups, to a 
second peptide having the formula: Cys-W-aacooH to thereby form 
the polypeptide: aaNH^^Q'^^x-'Cys-W-aacooH* and 

(B) incubating said polypeptide in the presence of a reagent Raa-X, where 
15 Raa is a group whose structure mimics the terminal portion of the side 

chain of amino acid residue aay; and X is a good leaving group; said 
incubation being under conditions sufficient to form said desired 
polypeptide. 

56. The method of claim 55, wherein X is a halogen. 

20 57. The method of claim 56, wherein said halogen is F, I or Br. 

58. The method of claim 55, wherein said amino acid residue aay is selected 
fix)m the group consisting of a pseudo-arginine; a pseudo-asparagine; a 
pseudo-aspartate; a pseudo-dopamine; a pseudo-glutamate; a pseudo- 
glutamine; a pseudo-histidine; a pseudo-isoleucine; a pseudo-leucine; a 
25 pseudo-lysine; a pseudo-methionine; a pseudo-phenyalanine; a pseudo- 

serine; a pseudo-threonine; a pseudo-tryptophan; a pseudo-tyrosine; and a 
pseudo-valine. 
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' A polypeptide of formula: 

aaNH^-Q-aax-aay-W-aacooH 

where Q and W each denote the optional presence of one or more 
additional amino acid residues, aa^Hj denotes the N-terminal amino acid 
residue of the polypeptide; aa^ and aay denote internal adjacent amino acid 
residues, having side chains x and y, respectively, and aacooH denotes tfie C- 
terminal amino acid residue of the polypeptide; 

wherein said polypeptide is produced by a method comprising: 

(A) ligatmg a first peptide having the formula: aaNH^-Q-aax-COSR, 
wherem R is any group compatible with the thioester group, 
including, but not limited to, aryl, benzyl, and alkyl groups, to a 
second peptide having the formula: Cys-W-aacooH to thereby form 
the polypeptide: aaNH2-Q-aax-Cys-W-aacooH; and 

(B) incubating said polypeptide in the presence of a reagent Raa-X, where 
Raa is a group whose structure mmiics the terminal portion of the side 
chain of amino acid residue aay; and X is a good leaving group; said 
incubation being under conditions suflBcient to form said desired 
polypeptide. 

The polypeptide of claim 59, wherein X is a halogen. 

The polypeptide of claim 60, wherein said halogen is F, I or Br. 

The polypeptide of claim 59, wherein said amino acid residue aay is selected 
from the group consisting of a pseudo-arginine; a pseudo-asparagine; a 
pseudo-aspartate; a pseudo-dopamine; a pseudo-glutamate; a pseudo- 
glutamine; a pseudo-histidine; a pseudo-isoleucine; a pseudo-leucine; a 
pseudo-lysine; a pseudo-methionme; a pseudo-phenyalanine; a pseudo- 
serine; a pseudorthreonme; a pseudo-tryptophan; a pseudo-tyrosine; and a 
pseudo-valine. 
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Figure 1 
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Figure 2 
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Figure 3 
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Figure 4 
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Figure 5 
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Figure? 
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SEQUENCE LISTING 

<110> Gryphon Sciences 

5 <120> "Pseudo "-Native Chemical Ligation 

<130> 03504.268 

<140> 
10 <141> 

<150> 60/231,330 
<151> 2000-09-08 

15 <150> 60/236,377 
<151> 2000-09-29 

<160> 3 

20 <170> Patentin Ver. 2.1 

<210> 1 
<211> 166 
<212> PRT 
25 <213> Homo sapiens 

<400> 1 

Ala Pro Pro Arg Leu lie Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 
15 10 15 



30 



45 



Leu Glu Ala Lys Glu Ala Glu Lys lie Thr Thr Gly Cys Ala Glu His 
20 25 30 



Cys Ser Leu Asn Glu Lys lie Thr Val Pro Asp Thr Lys Val Asn Phe 
35 35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 . 60 

40 Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu 
65 70 75 80 



Leu Val Lys Ser Ser Gin Pro Trp Cys Pro Leu Gin Leu His Val Asp 

85 90 95 

Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 

100 105 110 



Gly Ala Gin Lys Cys Ala He Ser Pro Pro Asp Ala Ala Ser Ala Ala 
50 115 120 125 

Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val 

130 135 140 

55 Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 
145 150 155 160 



60 



Cys Arg Thr Gly Asp Arg 
165 
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<210> 2 
<211> 166 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 

15 10 15 

10 Leu Glu Ala Lys Glu Ala Glu Lys He Thr Thr Gly Cys Ala Glu His 
20 25 30 



15 



30 



40 



45 



55 



Cys Ser Leu Asn Glu Lys He Thr Val Pro Asp Thr Lys Val Asn Phe 
35 40 45 

Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 



Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu 
20 65 70 75 80 

Leu Val Lys Ser Ser Gin Pro Trp Cys Pro Leu Gin Leu His Val Asp 
85 90 95 

25 Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 
100 105 110 



Gly Ala Gin Lys Cys Ala He Ser Pro Pro Asp Ala Ala Lys Ala Ala 
115 120 125 

Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val 
130 135 140 



Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 
35 145 150 155 160 



Cys Arg Thr Gly Asp Arg 
165 



<210> 3 
<211> 166 
<212> PRT 

<213> Homo sapiens 



<400> 3 

Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu 

1 5 .10 15 

50 Leu Glu Ala Lys Glu Ala Glu Cys He Thr Thr Gly Cys Ala Glu His 

20 25 30 

Cys Ser Leu Asn Glu Cys He Thr Val Pro Asp Thr Lys Val Asn Phe 

35 40 45 



Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala Val Glu Val Trp 
50 55 60 



Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu 
60 65 70 75 80 
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Leu Ala Cys Ser Ser Gin Pro Trp Cys Pro Leu Gin Leu His Val Asp 
85 90 95 

Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu 
5 100 . 105 110 

Gly Ala Gin Lys Cys Ala lie Ser Pro Pro Asp Ala Ala Cys Ala Ala 
115 120 125 

10 Pro Leu Arg Thr lie Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val 
130 135 140 

Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala 
145 150 155 160 

15 

Cys Arg Thr Gly Asp Arg 
165 
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